Skip to content

Latest commit

 

History

History
56 lines (41 loc) · 2.44 KB

README.md

File metadata and controls

56 lines (41 loc) · 2.44 KB

GitHub Twitter @data4sci GitHub top language GitHub repo size GitHub last commit

Data For Science Substack Data Science Briefing

Binder

NLP with PyTorch

Code and slides to accompany the online series of webinars: https://data4sci.com/nlp-with-pytorch by Data For Science.

Run the code in Binder: Binder

Natural Language lies at the heart of current developments in Artificial Intelligence, User Interaction, and Information Processing. The combination of unprecedented corpora of written text provided by social media and the massification of computational power has led to increased interest in the development of modern NLP tools based on state-of-the-art Deep Learning tools.

In this course, participants are introduced to the fundamental concepts and algorithms used for Natural Language Processing (NLP) through an in-depth exploration of different examples built using the PyTorch framework for deep learning. Applications to real datasets will be explored in detail.

Schedule

1. Foundations of NLP

  • One-Hot Encoding
  • TF/IDF and Stemming
  • Stopwords
  • N-grams
  • Working with Word Embeddings

2. Neural Networks with PyTorch

  • PyTorch review
  • Activation Functions
  • Loss Functions
  • Training procedures
  • Network Architectures

3. Text classification

  • Feed Forward Networks
  • Convolutional Neural Networks
  • Applications

4. Word Embeddings

  • Motivations
  • Skip-gram and Continuous Bag of words
  • Transfer Learning

5. Sequence Modeling

  • Recurrent Network Networks
  • Gated Recurrent Unit
  • Long-Short Term Memory
  • Encoder-Decoder Models
  • Text Generation