article thumbnail

Modular Deep Learning

Sebastian Ruder

For modular fine-tuning for NLP, check out our EMNLP 2022 tutorial. d) Hypernetwork: A small separate neural network generates modular parameters conditioned on metadata. We provide a high-level overview of some of the trade-offs of the different computation functions below. For a more in-depth review, refer to our survey.

article thumbnail

ACL 2022 Highlights

Sebastian Ruder

Seeing the emergence of such multilingual multimodal approaches is particularly encouraging as it is an improvement over the previous year’s ACL where multimodal approaches mainly dealt with English (based on an analysis of “multi-dimensional” NLP research we did for an ACL 2022 Findings paper ). Hershcovich et al.

NLP 52
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The State of Multilingual AI

Sebastian Ruder

At the same time, a wave of NLP startups has started to put this technology to practical use. I will be focusing on topics related to natural language processing (NLP) and African languages as these are the domains I am most familiar with. This post takes a closer look at how the AI community is faring in this endeavour.

article thumbnail

All Languages Are NOT Created (Tokenized) Equal

Topbots

Language Disparity in Natural Language Processing This digital divide in natural language processing (NLP) is an active area of research. 70% of research papers published in a computational linguistics conference only evaluated English.[ Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold.

article thumbnail

Accelerate hyperparameter grid search for sentiment analysis with BERT models using Weights & Biases, Amazon EKS, and TorchElastic

AWS Machine Learning Blog

Sentiment analysis and other natural language programming (NLP) tasks often start out with pre-trained NLP models and implement fine-tuning of the hyperparameters to adjust the model to changes in the environment. She has a technical background in AI and Natural Language Processing.

BERT 75