Blog - Artificial Intelligence Zone

Clustering with Scikit-Learn: a Gentle Introduction

Towards AI

FEBRUARY 23, 2024

Learn how to apply state-of-the-art clustering algorithms efficiently and boost your machine-learning skills.Image source: unsplash.com. I will present the theory of the most used clustering models, and we will understand how to practically implement them with Scikit-Learn. As… Read the full blog for free on Medium.

Machine Learning

Machine Learning Data Scientist Algorithm Data Science

Meet the Fellow: Umang Bhatt

NYU Center for Data Science

JUNE 16, 2023

This entree is a part of our Meet the Fellow blog series, which introduces and highlights Faculty Fellows who have recently joined CDS CDS Assistant Professor/Faculty Fellow, Umang Bhatt Meet CDS Assistant Professor/Faculty Fellow Umang Bhatt , who will join CDS this fall. For these reasons, I am excited to start my academic journey at NYU.

Machine Learning

Machine Learning Explainability Artificial Intelligence Artificial Intelligence

Automate PDF pre-labeling for Amazon Comprehend

AWS Machine Learning Blog

DECEMBER 14, 2023

To reduce the effort of preparing training data, we built a pre-labeling tool using AWS Step Functions that automatically pre-annotates documents by using existing tabular entity data. Solution overview In this section, we discuss the inputs and outputs of the pre-labeling tool and provide an overview of the solution architecture.

Automation

Automation Natural Language Processing Machine Learning Deep Learning

Webinars

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

How To Get Promoted In Product Management

MORE WEBINARS

Google Research, 2022 & beyond: Algorithms for efficient deep learning

Google Research AI blog

FEBRUARY 7, 2023

The explosion in deep learning a decade ago was catapulted in part by the convergence of new algorithms and architectures, a marked increase in data, and access to greater compute. Below, we highlight a panoply of works that demonstrate Google Research’s efforts in developing new algorithms to address the above challenges.

Deep Learning

Deep Learning Algorithm Neural Network ML

Understanding Deep Learning Algorithms that Leverage Unlabeled Data, Part 1: Self-training

The Stanford AI Lab Blog

FEBRUARY 24, 2022

Deep models require a lot of training examples, but labeled data is difficult to obtain. For example, large quantities of unlabeled image data can be obtained by crawling the web, whereas labeled datasets such as ImageNet require expensive labeling procedures. Chen et al., 2020 , Sohn et al., Chen et al., 2020 , Sohn et al.,

Deep Learning

Deep Learning Algorithm Explainability

Google at ICLR 2023

Google Research AI blog

APRIL 30, 2023

If you’re registered for ICLR 2023, we hope you’ll visit the Google booth to learn more about the exciting work we’re doing across topics spanning representation and reinforcement learning, theory and optimization, social impact, safety and privacy, and applications from generative AI to speech and robotics.

Neural Network

Neural Network Large Language Models Machine Learning Deep Learning

Bundesliga Match Facts Shot Speed – Who fires the hardest shots in the Bundesliga?

AWS Machine Learning Blog

NOVEMBER 3, 2023

To achieve this, our process uses a synchronization algorithm that is trained on a labeled dataset. This algorithm robustly associates each shot with its corresponding tracking data. Shot speed calculation The heart of determining shot speed lies in a precise timestamp given by our synchronization algorithm.

Data Scientist

Data Scientist Algorithm Data Science Machine Learning

Paper Summary #8 - FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Shreyansh Singh

MARCH 26, 2023

There are many approximate attention methods out there like Reformer, Smyrf, Reformer, Performer and others ( you can find more details on a few of these in my previous blog ) which aim to reduce the compute requirements to linear or near-linear in sequence length, but many of them do not display wall-clock speedup against standard attention.

Algorithm

Algorithm BERT Explainability

Stanford AI Lab Papers and Talks at NeurIPS 2021

The Stanford AI Lab Blog

DECEMBER 6, 2021

We’re excited to share all the work from SAIL that’s being presented at the main conference , at the Datasets and Benchmarks track and the various workshops , and you’ll find links to papers, videos and blogs below.

Neural Network

Neural Network Deep Learning Machine Learning AI

Explain medical decisions in clinical settings using Amazon SageMaker Clarify

AWS Machine Learning Blog

AUGUST 21, 2023

Background One specific application of ML algorithms in the medical domain, which uses large volumes of text, is clinical decision support systems (CDSSs) for triage. Predictions of these are now highly achievable from admission notes alone, through the use of natural language processing (NLP) algorithms [1].

Explainability

Explainability BERT ML NLP

Continual Learning: Methods and Application

The MLOps Blog

FEBRUARY 22, 2024

Every input example needs a task label that helps identify your expected output. For instance, outputs in classification and text summarization problems are different, so based on the task label, you can decide if the current example trains classification or extraction.

Continuous Learning

Continuous Learning Machine Learning ML Neural Network

The Intuition behind Adversarial Attacks on Neural Networks

ML Review

MARCH 31, 2019

Up to this point, machine learning algorithms simply didn’t work well enough for anyone to be surprised when it failed to do the right thing. The reason it works is that unlike the first model, the second model is trained on the primary model’s “soft” probability outputs, rather than the “hard” (0/1) true labels from the real training data.

Neural Network

Neural Network Machine Learning Deep Learning Explainability

Google at ICML 2023

Google Research AI blog

JULY 23, 2023

Posted by Cat Armato, Program Manager, Google Groups across Google actively pursue research in the field of machine learning (ML), ranging from theory and application. We build ML systems to solve deep scientific and engineering challenges in areas of language, music, visual processing, algorithm development, and more.

Neural Network

Neural Network Machine Learning Large Language Models Algorithm

Semi-supervised Deep Learning for Medical Image Segmentation

Heartbeat

FEBRUARY 28, 2023

Finally, we will look at some of the recent semi-supervised medical image segmentation algorithms. SSL is a machine learning paradigm that combines a very small amount of labeled data along with a large amount of unlabelled data for training. Let’s dive in! What is Semi-supervised Learning (SSL)?

Deep Learning

Deep Learning Neural Network Algorithm Machine Learning

Grading Complex Interactive Coding Programs with Reinforcement Learning

The Stanford AI Lab Blog

MARCH 28, 2022

[Summary] tl;dr: A tremendous amount of effort has been poured into training AI algorithms to competitively play games that computers have traditionally had trouble with, such as the retro games published by Atari, Go, DotA, and StarCraft II. Can the same algorithms that master Atari games help us grade these game assignments?

Algorithm

Algorithm Auto-classification Automation Auto-complete

Cleanlab CEO shows automatic data-cleansing tools

Snorkel AI

FEBRUARY 17, 2023

First I’ll chat a bit about millions of label errors and the 10 most common machine learning benchmark data sets. This is built on a theory that we developed at MIT called Confident Learning , which is a subfield of machine learning for learning with noisy labels, finding errors in data, and estimating uncertainty in data.

Machine Learning

Machine Learning Algorithm AI AI

Cleanlab CEO shows automatic data-cleansing tools

Snorkel AI

FEBRUARY 17, 2023

First I’ll chat a bit about millions of label errors and the 10 most common machine learning benchmark data sets. This is built on a theory that we developed at MIT called Confident Learning , which is a subfield of machine learning for learning with noisy labels, finding errors in data, and estimating uncertainty in data.

Machine Learning

Machine Learning Algorithm AI AI

How front-end development can improve Artificial Intelligence

Explosion

AUGUST 21, 2016

While researchers rightly focus on better algorithms, there are a lot more things to be done. Data Collection and Training Contrary to popular belief, the bottleneck in AI is data , not algorithms. This suggests a tempting theory: annotation time should be dirt cheap, right? What’s holding back Artificial Intelligence?

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Algorithm NLP

Artificial Neural Networks in Machine Learning

Mlearning.ai

FEBRUARY 25, 2023

How does the Artificial Neural Network algorithm work? ANN approach is a machine learning algorithm inspired by biological neural networks. While classical machine learning algorithms fell short of analyzing big data, artificial neural networks performed well on big data. This is where the backpropagation algorithm comes in.

Neural Network

Neural Network Machine Learning Deep Learning Convolutional Neural Networks

A Deep Dive into Variational Autoencoders with PyTorch

PyImageSearch

OCTOBER 2, 2023

Figure 2: Variational Autoencoder Architecture Diagram (source: image created by the author for the LearnOpenCV Blog ). Figure 3: The reparameterization trick transforms a stochastic node into a deterministic one, facilitating gradient flow (source: image designed by the author for the LearnOpenCV Blog ). The config.py

Computer Vision

Computer Vision Deep Learning Neural Network Auto-complete

Supervised learning is great — it's data collection that's broken

Explosion

APRIL 1, 2017

Prodigy features many of the ideas and solutions for data collection and supervised learning outlined in this blog post. Most AI systems today rely on supervised learning : you provide labelled input and output pairs, and get a program that can perform analogous computation for new data. Try Prodigy!

Algorithm

Algorithm Machine Learning Deep Learning Python

On Privacy and Personalization in Federated Learning: A Retrospective on the US/UK PETs Challenge

ML @ CMU

MAY 12, 2023

In short, this says that the (k)-th data silo may set its own ((varepsilon_k, delta_k)) example-level DP target for any learning algorithm with respect to its local dataset. Of course, the problem lies in what that feature vector (and the corresponding label) is—we’ll get to this in the following section.

Explainability

Explainability Algorithm Neural Network ML

Google at NeurIPS 2022

Google Research AI blog

NOVEMBER 28, 2022

Organizing Committee General Chairs includes: Sanmi Koyejo Program Chairs include: Alekh Agarwal Workshop Chairs include: Hanie Sedghi Tutorial Chairs include: Adji Bousso Dieng , Jessica Schrouff Affinity Workshop Chair: Adji Bousso Dieng , Jessica Schrouff Program Committee, Senior Area Chairs include: Corinna Cortes , Claudio Gentile , Mohammad (..)

Neural Network

Neural Network Machine Learning Large Language Models Algorithm

sense2vec reloaded: contextually-keyed word vectors

Explosion

NOVEMBER 21, 2019

al, 2015) is a twist on the word2vec family of algorithms that lets you learn more interesting word vectors. sense2vec reloaded: the updated library sense2vec is a Python package to load and query vectors of words and multi-word phrases based on part-of-speech tags and entity labels. It was a nice idea in theory.

NLP

NLP Convolutional Neural Networks Neural Network Natural Language Processing

How to See Like a Machine

Mlearning.ai

JUNE 5, 2023

A Guide to Computer Vision Tools Hello and welcome to my blog on computer vision tools! In this blog, I will introduce you to some of the most popular and powerful computer vision tools that you can use to unleash your creativity and have fun. Learn the basics of computer vision theory. Let’s get started!

Computer Vision

Computer Vision Deep Learning Python Neural Network

AI Distillery (Part 2): Distilling by Embedding

ML Review

MARCH 5, 2019

Word embeddings Visualisation of word embeddings in AI Distillery Word2vec is a popular algorithm used to generate word representations (aka embeddings) for words in a vector space. Then, the algorithm proceeds with the following word as the new centre word, i.e. “learning”, sets up the new context, and repeats the same procedure.

AI

AI AI Computer Vision Computational Linguistics

Parsing English in 500 Lines of Python

Explosion

DECEMBER 17, 2013

I wrote this blog post in 2013, describing an exciting advance in natural language understanding technology. Today, almost all high-performance parsers are using a variant of the algorithm described below (including spaCy). We could, in theory, have written our guidelines so that the “correct” parses were reversed.

Python

Python Algorithm NLP Computational Linguistics

Deployment of Data and ML Pipelines for the Most Chaotic Industry: The Stirred Rivers of Crypto

The MLOps Blog

DECEMBER 7, 2022

Given that the whole theory of machine learning assumes today will behave at least somewhat like yesterday, what can algorithms and models do for you in such a chaotic context ? And that includes data. Next comes a stage of hyperparameter tuning for the models. Then comes a very exhaustive evaluation stage.

ML

ML ETL Data Scientist Automation

Effectively solve distributed training convergence issues with Amazon SageMaker Hyperband Automatic Model Tuning

AWS Machine Learning Blog

JULY 13, 2023

Another way can be to use an AllReduce algorithm. For example, in the ring-allreduce algorithm, each node communicates with only two of its neighboring nodes, thereby reducing the overall data transfers. Train a binary classification model using the SageMaker built-in XGBoost algorithm.

Algorithm

Algorithm Deep Learning Neural Network ML

Artificial Intelligence Zone

Clustering with Scikit-Learn: a Gentle Introduction

Meet the Fellow: Umang Bhatt

Webinars

Trending Sources

Automate PDF pre-labeling for Amazon Comprehend

Webinars

Google Research, 2022 & beyond: Algorithms for efficient deep learning

Understanding Deep Learning Algorithms that Leverage Unlabeled Data, Part 1: Self-training

Google at ICLR 2023

Bundesliga Match Facts Shot Speed – Who fires the hardest shots in the Bundesliga?

Paper Summary #8 - FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Stanford AI Lab Papers and Talks at NeurIPS 2021

Explain medical decisions in clinical settings using Amazon SageMaker Clarify

Continual Learning: Methods and Application

The Intuition behind Adversarial Attacks on Neural Networks

Google at ICML 2023

Semi-supervised Deep Learning for Medical Image Segmentation

Grading Complex Interactive Coding Programs with Reinforcement Learning

Cleanlab CEO shows automatic data-cleansing tools

Cleanlab CEO shows automatic data-cleansing tools

How front-end development can improve Artificial Intelligence

Artificial Neural Networks in Machine Learning

A Deep Dive into Variational Autoencoders with PyTorch

Supervised learning is great — it's data collection that's broken

On Privacy and Personalization in Federated Learning: A Retrospective on the US/UK PETs Challenge

Google at NeurIPS 2022

sense2vec reloaded: contextually-keyed word vectors

How to See Like a Machine

AI Distillery (Part 2): Distilling by Embedding

Parsing English in 500 Lines of Python

Deployment of Data and ML Pipelines for the Most Chaotic Industry: The Stirred Rivers of Crypto

Effectively solve distributed training convergence issues with Amazon SageMaker Hyperband Automatic Model Tuning

Stay Connected