Artificial Intelligence Zone

AI Learns from AI: The Emergence of Social Learning Among Large Language Models

Unite.AI

MARCH 22, 2024

These LLMs, designed to process and generate human-like text, learn from an extensive array of texts from the internet, ranging from books to websites. This learning process allows them to capture the essence of human language making them general purpose problem solvers. What's Social Learning? Social learning isn't a new idea.

Large Language Models

Large Language Models AI AI Artificial Intelligence

Reinforcement Learning: Training AI Agents Through Rewards and Penalties

Marktechpost

MAY 7, 2024

Reinforcement learning (RL) is a fascinating field of AI focused on training agents to make decisions by interacting with an environment and learning from rewards and penalties. RL differs from supervised learning because it involves doing rather than learning from a static dataset.

Robotics

Robotics Algorithm AI AI

Google AI Proposes PERL: A Parameter Efficient Reinforcement Learning Technique that can Train a Reward Model and RL Tune a Language Model Policy with LoRA

Marktechpost

MARCH 22, 2024

Reinforcement Learning from Human Feedback (RLHF) enhances the alignment of Pretrained Large Language Models (LLMs) with human values, improving their applicability and reliability. However, aligning LLMs through RLHF faces significant hurdles, primarily due to the process’s computational intensity and resource demands.

Large Language Models

Large Language Models Artificial Intelligence Artificial Intelligence AI

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Emerging Trends in Reinforcement Learning: Applications Beyond Gaming

Marktechpost

APRIL 16, 2024

Reinforcement Learning (RL) is expanding its footprint, finding innovative uses across various industries far beyond its origins in gaming. Algorithmic Trading: Executing high-speed trades based on learned strategies from vast market data. Smart Cities In urban planning, RL is used to optimize traffic management systems.

Robotics

Robotics Continuous Learning Algorithm Automation

Unlearning Copyrighted Data From a Trained LLM – Is It Possible?

Unite.AI

JANUARY 23, 2024

In the domains of artificial intelligence (AI) and machine learning (ML), large language models (LLMs) showcase both achievements and challenges. These techniques are resource-intensive and time-consuming, making them difficult to implement. Trained on vast textual datasets, LLM models encapsulate human language and knowledge.

LLM

LLM Large Language Models OpenAI ML

Improve LLM performance with human and AI feedback on Amazon SageMaker for Amazon Engineering

AWS Machine Learning Blog

APRIL 24, 2024

In this post, we share how we analyzed the feedback data and identified limitations of accuracy and hallucinations RAG provided, and used the human evaluation score to train the model through reinforcement learning. To increase training samples for better learning, we also used another LLM to generate feedback scores.

LLM

LLM AI AI Generative AI

UC Berkeley Researchers Introduce SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Marktechpost

FEBRUARY 7, 2024

In recent years, researchers in the field of robotic reinforcement learning (RL) have achieved significant progress, developing methods capable of handling complex image observations, training in real-world scenarios, and incorporating auxiliary data, such as demonstrations and prior experience.

Robotics

Robotics Algorithm Automation ML

Next-Gen AI: OpenAI and Meta’s Leap Towards Reasoning Machines

Unite.AI

APRIL 19, 2024

This progress stems from a generative AI training strategy where models learn to predict missing words and pixels. Furthermore, the quest for AGI seeks to develop AI systems that match the learning efficiency, adaptability, and application capabilities observed in humans and animals.

OpenAI

OpenAI AI AI Artificial Intelligence

How IBM invests in opportunities to reach maximum equitable impact

IBM Journey to AI blog

APRIL 11, 2023

Today, I want to share some examples that highlight IBM’s 2022 progress in the area of social impact. That is why we take our commitment to providing career training, learning opportunities and career growth very seriously. In 2022, we increased the hours of learning per employee.

ESG

3 myths hindering your business from adopting generative AI

IBM Journey to AI blog

DECEMBER 4, 2023

Business leaders overestimate the resources required to pursue AI, often unaware of the available technology and training. We also provide the same contractual intellectual property protections  for IBM-developed AI models as we do for all our products, reinforcing trust in businesses’ AI journeys.

Generative AI

Generative AI AI AI AI Chatbots

Kamal Ahluwalia, Ikigai Labs: How to take your business to the next level with generative AI

AI News

APRIL 17, 2024

While some companies have gone the route of building their own tech stack so LLMs can be used in a safe environment, most organisations lack the talent and resources to build it themselves. Especially in regulated industries, human oversight, validation, and reinforcement learning are necessary.

Generative AI

Generative AI Big Data AI AI

Supercharging Graph Neural Networks with Large Language Models: The Ultimate Guide

Unite.AI

MAY 8, 2024

Graph Neural Networks (GNNs) have emerged as a powerful deep learning framework for graph machine learning tasks. The tremendous success of LLMs has catalyzed explorations into leveraging their power for graph machine learning tasks. Some examples include GraphormerTransformer , and GraphFormers.

Neural Network

Neural Network Large Language Models LLM BERT

How RLHF Preference Model Tuning Works (And How Things May Go Wrong)

AssemblyAI

AUGUST 3, 2023

One such method, Reinforcement Learning from Human Feedback (RLHF) , is currently leading the charge. The first step is to create a dataset of examples reflecting human preferences. Once the reward model is ready, it can be used to fine-tune the base model via Reinforcement Learning.

LLM

LLM Chatbots ChatGPT OpenAI

NVIDIA Isaac Taps Generative AI for Manufacturing and Logistics Applications

NVIDIA

MARCH 18, 2024

On stage before a crowd of 10,000-plus, NVIDIA founder and CEO Jensen Huang demonstrated Project GR00T , which stands for Generalist Robot 00 Technology, a general-purpose foundation model for humanoid robot learning. Isaac Lab is an open-source, performance-optimized application for robot learning built on the Isaac Sim platform.

Robotics

Robotics Generative AI Machine Learning AI

This AI Paper Introduces the ‘ForgetFilter’: A Machine Learning Algorithm that Filters Unsafe Data based on How Strong the Model’s Forgetting Signal is for that Data

Marktechpost

DECEMBER 24, 2023

This can be achieved through reinforcement learning from human feedback (RLHF) or traditional supervised learning. Its novel approach involves strategically filtering unsafe examples from noisy downstream data, mitigating the risks associated with biased or harmful model outputs.

Machine Learning

Machine Learning Large Language Models Algorithm LLM

Optimize equipment performance with historical data, Ray, and Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 7, 2023

Offline reinforcement learning is a control strategy that allows industrial companies to build control policies entirely from historical data without the need for an explicit process model. To learn more about reinforcement learning, see Use Reinforcement Learning with Amazon SageMaker.

Natural Language Processing

Natural Language Processing Machine Learning Algorithm Automation

The Rise of Generative AI: From Art to Content Creation

Marktechpost

APRIL 16, 2024

This innovative technology utilizes machine learning algorithms to produce content autonomously, ranging from images and music to text and videos. Example: AI artist Robbie Barrat utilizes GANs to produce artworks that challenge traditional concepts of authorship and artistic merit. and Jasper.ai ” Netflix.

Generative AI

Generative AI AI AI Algorithm

The Unsung Hero of Machine Learning — Linear Algebra

Towards AI

JANUARY 28, 2024

Image by [link] Machine learning, data mining, deep learning, and advanced optimization algorithms all rely heavily on linear algebra. Linear algebra is widely used in almost all machine learning algorithms. For example, Gaussian elimination aids in the solution of normal equations derived from the least squares method.

Machine Learning

Machine Learning Deep Learning Algorithm Data Mining

The New 3-Legged Stool of Sustainable Innovation: Data, AI, & Human Creativity

Unite.AI

FEBRUARY 5, 2024

They’re evaluating and testing the efficiencies and speed enabled by consuming existing AI services, and then developing capabilities to create competitive advantage – for example, by tuning models and training them to use their own proprietary data.”

AI

AI AI Automation Artificial Intelligence

Patrick M. Pilarski, Ph.D. Canada CIFAR AI Chair (Amii) – Interview Series

Unite.AI

MAY 30, 2023

Intelligence is just one of these amazing examples of that, so whether it's coming from biology or whether it's coming from how we see elaborate behavior emerge in machines, I think there's something beautiful about that. Could you talk about the machine learning behind this? The underlying technologies support continual learning.

Machine Learning

Machine Learning AI AI Robotics

Automated Prompt Engineering: Leveraging Synthetic Data and Meta-Prompts for Enhanced LLM Performance

Marktechpost

MARCH 4, 2024

Recent studies propose using meta-prompts that learn from past trials to suggest improved prompts automatically. A team of researchers has devised Intent-based Prompt Calibration (IPC), a system to fine-tune prompts based on user intention using synthetic examples. Users may provide examples in a few-shot setting.

Prompt Engineer

Prompt Engineer Prompt Engineering LLM Automation

Data is essential: Building an effective generative AI marketing strategy

IBM Journey to AI blog

SEPTEMBER 6, 2023

For example, a retail clothing company might use generative AI to customize email or online experiences tailored for different customer personas. 1 When asked about their biggest concerns regarding generative AI, leaders were focused on data accuracy, privacy management and having the skilled resources to build this solution.

Generative AI

Generative AI AI AI AI Tools

This NIST Trustworthy and Responsible AI Report Develops a Taxonomy of Concepts and Defines Terminology in the Field of Adversarial Machine Learning (AML)

Marktechpost

JANUARY 17, 2024

The well-known Large Language Models (LLMs), which have recently gathered massive attention, are the best examples of generative AI. The goal is to provide a thorough resource that helps shape future practice guides and standards for evaluating and controlling the security of AI systems.

Machine Learning

Machine Learning Responsible AI Artificial Intelligence Artificial Intelligence

The most valuable AI use cases for business

IBM Journey to AI blog

FEBRUARY 14, 2024

Using machine learning (ML), AI can understand what customers are saying as well as their tone—and can direct them to customer service agents when needed. For example, Amazon reminds customers to reorder their most often-purchased products, and shows them related products or suggestions.

Computer Vision

Computer Vision Automation Robotics AI

Conversation AI: What it is and top use cases

AssemblyAI

AUGUST 29, 2023

Examples of Conversation AI include a chatbot on a website or a virtual assistant on a help page. Other companies across industries integrate Conversation AI tools to increase resource and cost efficiency, increase sales and customer engagement and satisfaction, and to more easily scale services. What is Conversation AI?

Conversational AI

Conversational AI AI Tools Chatbots AI

How ROBOSHOT boosts zero-shot foundation model performance

Snorkel AI

APRIL 30, 2024

For example, in classifying water birds versus land birds, if the pre-training dataset often shows water birds in front of water and land birds on land, the model may mistakenly use the background as the basis for its prediction. This process, while effective, requires substantial human and computational resources. A promising start.

LLM

LLM BERT Large Language Models Machine Learning

4 Ways SMEs Can Use Technology to Upskill Their Workforce

Aiiot Talk

JUNE 14, 2023

Small and medium-sized enterprises (SMEs) often need more resources to deploy technology in the large-scale ways bigger companies can. Plus, 47% of companies using VR said it helped improve workers’ understanding of what they learned. For example, 99% of non-profits understand the necessity of donor information security.

Robotics

Robotics Artificial Intelligence Artificial Intelligence AI

The Multimodal Marvel: Exploring GPT-4o’s Cutting-Edge Capabilities

Unite.AI

MAY 15, 2024

From the early days of rule-based systems to the advent of machine learning and deep learning , AI has evolved to become more advanced and versatile. For example, one can now take a picture of a menu in a different language and ask GPT-4o to translate it or learn about the food.

Neural Network

Neural Network OpenAI Software Development AI Modeling

Cam Linke, CEO at Alberta Machine Intelligence Institute (Amii) – Interview Series

Unite.AI

JUNE 26, 2023

His research, which focuses on AI adapting behaviors' to improve their own self-learning, has been published at top conferences. To be able to be around each other, learn, and grow from each other. Why the heck is Edmonton one of the places leading the world in this AI and machine learning thing? Amii was founded in 2002.

Machine Learning

Machine Learning Artificial Intelligence Artificial Intelligence Large Language Models

How ROBOSHOT boosts zero-shot foundation model performance

Snorkel AI

APRIL 30, 2024

For example, in classifying water birds versus land birds, if the pre-training dataset often shows water birds in front of water and land birds on land, the model may mistakenly use the background as the basis for its prediction. This process, while effective, requires substantial human and computational resources. A promising start.

LLM

LLM BERT Large Language Models Machine Learning

Improving your LLMs with RLHF on Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 22, 2023

Reinforcement Learning from Human Feedback (RLHF) is recognized as the industry standard technique for ensuring large language models (LLMs) produce content that is truthful, harmless, and helpful. Reward models and reinforcement learning are applied iteratively with human-in-the-loop feedback.

Machine Learning

Machine Learning LLM Computer Vision Large Language Models

Meta-Learning: Learning to Learn in Machine Learning

Heartbeat

JANUARY 29, 2024

Photo by Brett Jordan on Unsplash In the ever-evolving landscape of artificial intelligence and machine learning, researchers and practitioners continuously seek to elevate the capabilities of intelligent systems. Among the myriad breakthroughs in this field, Meta-Learning is pushing the boundaries of machine learning.

Machine Learning

Machine Learning Neural Network Natural Language Processing Algorithm

This AI Paper from UCLA Introduces ‘SPIN’ (Self-Play fIne-tuNing): A Machine Learning Method to Convert a Weak LLM to a Strong LLM by Unleashing the Full Power of Human-Annotated Data

Marktechpost

JANUARY 5, 2024

To align the performance of such models with desirable behavior, they are fine-tuned using techniques like Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). The authors demonstrated the effectiveness of SPIN through an example. If you like our work, you will love our newsletter.

LLM

LLM Machine Learning Natural Language Processing Large Language Models

Researchers from Harvard Introduce Inference-Time Intervention (ITI): An AI Technique that Improves the Truthfulness of Language Models from 32.5% to 65.1%

Marktechpost

JUNE 14, 2023

ITI differs from currently used techniques like RLHF (Reinforcement Learning from Human Feedback), which depend on modifying pretrained language models with reinforcement learning and require a lot of computation and annotation resources. The team has shared an example of a comparison between LLaMA and ITI.

Large Language Models

Large Language Models BERT Artificial Intelligence Artificial Intelligence

Stanford Researchers Introduce SequenceMatch: Training LLMs With An Imitation Learning Loss

Marktechpost

JUNE 26, 2023

One of the most well-known examples of autoregressive models is the class of GPT models, especially GPT-3 and its variants, which are largely based on the foundation of predicting the next word in a sequence given the previous words. In other words, the model predicts the future value of a variable by regressing it on its past values.

AI Tools

AI Tools AI Researcher AI Research ML

Generative vs Predictive AI: Key Differences & Real-World Applications

Topbots

OCTOBER 4, 2023

In this article, we will review the key machine-learning techniques driving these two major classes of AI approaches, the unique benefits and challenges associated with them, and their respective real-world business applications. It does this by learning from existing data and then generating new data that is similar to the training data.

Generative AI

Generative AI Natural Language Processing Machine Learning Convolutional Neural Networks

Pacific Northwest National Lab’s chief scientist for AI finds links between tech and national security

Flipboard

JANUARY 3, 2024

We can better prioritize infrastructure, prioritize resources, prioritize tooling, upskilling, training and so forth.” We have other efforts that we are just starting with the Department of Energy — for example, to accelerate permitting processes. “It makes sense to bring everybody together in a virtual research hub.

AI

AI AI Deep Learning Artificial Intelligence

Google at ICLR 2023

Google Research AI blog

APRIL 30, 2023

Posted by Catherine Armato, Program Manager, Google The Eleventh International Conference on Learning Representations (ICLR 2023) is being held this week as a hybrid event in Kigali, Rwanda. We are proud to be a Diamond Sponsor of ICLR 2023, a premier conference on deep learning, where Google researchers contribute at all levels.

Neural Network

Neural Network Large Language Models Machine Learning Deep Learning

Chatbot Development Using Reinforcement Learning and NLP Techniques

Heartbeat

JULY 5, 2023

In this article, you will learn how to use RL and NLP to create an entire chatbot system. What is Reinforcement? Reinforcement learning is a subfield of machine learning (ML) that teaches an agent to learns how to act in a particular setting in order to maximize the reward signal there. Why is NLP Required?

NLP

NLP Chatbots Natural Language Processing Deep Learning

Generative Adversarial Networks (GANs) vs. Deep Reinforcement Learning (DRL)

Heartbeat

MAY 30, 2023

Photo by Othmar Vigl on Pexels Introduction Generative Adversarial Networks (GANs) and Deep Reinforcement Learning (DRL) are two popular and continuously developing artificial intelligence subfields that have gotten a lot of interest and research in recent years. What is Deep Reinforcement Learning (DRL)?

Neural Network

Neural Network Convolutional Neural Networks Deep Learning Machine Learning

Establishing an AI/ML center of excellence

AWS Machine Learning Blog

MAY 9, 2024

The rapid advancements in artificial intelligence and machine learning (AI/ML) have made these technologies a transformative force across industries. According to a McKinsey study , across the financial services industry (FSI), generative AI is projected to deliver over $400 billion (5%) of industry revenue in productivity benefits.

ML

ML Generative AI AI AI

Machine Learning Engineering in the Real World

ODSC - Open Data Science

SEPTEMBER 21, 2023

The majority of us who work in machine learning, analytics, and related disciplines do so for organizations with a variety of different structures and motives. In pretty much all of these cases, we do not do this work in a vacuum and not with an infinite budget of time or resources.

Machine Learning

Machine Learning ML Engineer ML Data Science

Can ChatGPT Compete with Domain-Specific Sentiment Analysis Machine Learning Models?

Topbots

JUNE 22, 2023

ChatGPT is a GPT ( G enerative P re-trained T ransformer) machine learning (ML) tool that has surprised the world. in “Assimilating sentiment analysis in reinforcement learning for intelligent trading”). For this code example, consider SemEval’s 2017 Task gold-standard dataset that you can get here.

Machine Learning

Machine Learning ChatGPT Natural Language Processing Categorization

Machine Learning vs. Deep Learning - A Comparison

Heartbeat

OCTOBER 11, 2023

This process is known as machine learning or deep learning. Two of the most well-known subfields of AI are machine learning and deep learning. What is Machine Learning? Machine learning algorithms can make predictions or classifications based on input data.

Deep Learning

Deep Learning Machine Learning Neural Network Natural Language Processing

AI Learns from AI: The Emergence of Social Learning Among Large Language Models

Reinforcement Learning: Training AI Agents Through Rewards and Penalties

Webinars

Trending Sources

Google AI Proposes PERL: A Parameter Efficient Reinforcement Learning Technique that can Train a Reward Model and RL Tune a Language Model Policy with LoRA

Webinars

Emerging Trends in Reinforcement Learning: Applications Beyond Gaming

Unlearning Copyrighted Data From a Trained LLM – Is It Possible?

Improve LLM performance with human and AI feedback on Amazon SageMaker for Amazon Engineering

UC Berkeley Researchers Introduce SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Next-Gen AI: OpenAI and Meta’s Leap Towards Reasoning Machines

How IBM invests in opportunities to reach maximum equitable impact

3 myths hindering your business from adopting generative AI

Kamal Ahluwalia, Ikigai Labs: How to take your business to the next level with generative AI

Supercharging Graph Neural Networks with Large Language Models: The Ultimate Guide

How RLHF Preference Model Tuning Works (And How Things May Go Wrong)

NVIDIA Isaac Taps Generative AI for Manufacturing and Logistics Applications

This AI Paper Introduces the ‘ForgetFilter’: A Machine Learning Algorithm that Filters Unsafe Data based on How Strong the Model’s Forgetting Signal is for that Data

Optimize equipment performance with historical data, Ray, and Amazon SageMaker

The Rise of Generative AI: From Art to Content Creation

The Unsung Hero of Machine Learning — Linear Algebra

The New 3-Legged Stool of Sustainable Innovation: Data, AI, & Human Creativity

Patrick M. Pilarski, Ph.D. Canada CIFAR AI Chair (Amii) – Interview Series

Automated Prompt Engineering: Leveraging Synthetic Data and Meta-Prompts for Enhanced LLM Performance

Data is essential: Building an effective generative AI marketing strategy

This NIST Trustworthy and Responsible AI Report Develops a Taxonomy of Concepts and Defines Terminology in the Field of Adversarial Machine Learning (AML)

The most valuable AI use cases for business

Conversation AI: What it is and top use cases

How ROBOSHOT boosts zero-shot foundation model performance

4 Ways SMEs Can Use Technology to Upskill Their Workforce

The Multimodal Marvel: Exploring GPT-4o’s Cutting-Edge Capabilities

Cam Linke, CEO at Alberta Machine Intelligence Institute (Amii) – Interview Series

How ROBOSHOT boosts zero-shot foundation model performance

Improving your LLMs with RLHF on Amazon SageMaker

Meta-Learning: Learning to Learn in Machine Learning

This AI Paper from UCLA Introduces ‘SPIN’ (Self-Play fIne-tuNing): A Machine Learning Method to Convert a Weak LLM to a Strong LLM by Unleashing the Full Power of Human-Annotated Data

Researchers from Harvard Introduce Inference-Time Intervention (ITI): An AI Technique that Improves the Truthfulness of Language Models from 32.5% to 65.1%

Stanford Researchers Introduce SequenceMatch: Training LLMs With An Imitation Learning Loss

Generative vs Predictive AI: Key Differences & Real-World Applications

Pacific Northwest National Lab’s chief scientist for AI finds links between tech and national security

Google at ICLR 2023

Chatbot Development Using Reinforcement Learning and NLP Techniques

Generative Adversarial Networks (GANs) vs. Deep Reinforcement Learning (DRL)

Establishing an AI/ML center of excellence

Machine Learning Engineering in the Real World

Can ChatGPT Compete with Domain-Specific Sentiment Analysis Machine Learning Models?

Machine Learning vs. Deep Learning - A Comparison

Stay Connected