Blog - Artificial Intelligence Zone

Open source large language models: Benefits, risks and types

IBM Journey to AI blog

SEPTEMBER 27, 2023

An open source LLM offers transparency regarding how it works, its architecture and training data and methodologies, and how it’s used. Added features and community contributions Pre-trained, open source LLMs allow fine-tuning. All this reduces the risk of a data leak or unauthorized access.

Large Language Models

Large Language Models LLM Explainability Chatbots

The most important AI trends in 2024

IBM Journey to AI blog

FEBRUARY 9, 2024

Enhanced with fine-tuning techniques and datasets developed by the open source community, many open models can now outperform all but the most powerful closed-source models on most benchmarks, despite far smaller parameter counts. Sam Altman, CEO of OpenAI (whose GPT-4 model is rumored to have around 1.76 households. households. [iv]

AI

AI AI Generative AI Artificial Intelligence

Llama 2. A significant milestone in the world of AI

deepsense.ai

NOVEMBER 30, 2023

In this blog post, we will focus on the widely-discussed Llama 2 model. While the first iteration of Llama (presented in late February 2023) was generously made available for non-commercial use, the second version, Llama 2, takes a leap forward, by not only being open to the public but also offering itself for commercial usage.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Reducing the cost of LLMs with quantization and efficient fine-tuning: how can businesses benefit from Generative AI with limited hardware?

deepsense.ai

FEBRUARY 28, 2024

The wide adoption of ChatGPT and other large language models (LLMs) among individuals made companies of all sizes and across all sectors of industry wonder how they could benefit from this upward-trending technology. The two main topics we will dive into are quantized inference and parameter-efficient fine-tuning.

Generative AI

Generative AI LLM Neural Network Algorithm

Large Language Models for Product Managers: 5 Things to Know

AssemblyAI

MAY 23, 2023

With these complex algorithms often labeled as "giant black boxes" in media, there's a growing need for accurate and easy-to-understand resources, especially for Product Managers wondering how to incorporate AI into their product roadmap. During training, text sequences are extracted from the corpus and truncated.

Large Language Models

Large Language Models Neural Network LLM Chatbots

The Top Large Language Models Going Into 2024

ODSC - Open Data Science

JANUARY 4, 2024

In this blog, we’re going to explore the top LLMs of 2023 and maybe find out why they’re popular. Over the last year, the GPT model has gotten even bigger, and more powerful and creative users have taken advantage of its robust dataset to make incredible things. It’s a massive model with over 33 billion parameters.

Large Language Models

Large Language Models BERT Natural Language Processing Data Science

Llama 2. A significant milestone in the world of AI

deepsense.ai

NOVEMBER 30, 2023

In this blog post, we will focus on the widely-discussed Llama 2 model. While the first iteration of Llama (presented in late February 2023) was generously made available for non-commercial use, the second version, Llama 2, takes a leap forward, by not only being open to the public but also offering itself for commercial usage.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

What Are ChatGPT and Its Friends?

Flipboard

MARCH 23, 2023

Maybe it’s surprising that ChatGPT can write software, maybe it isn’t; we’ve had over a year to get used to GitHub Copilot, which was based on an earlier version of GPT. It’s a convenient user interface built around one specific language model, GPT-3.5, which has received some specialized training. with specialized training.

ChatGPT

ChatGPT Large Language Models OpenAI Explainability

The Ascent of ChatGPT

ODSC - Open Data Science

FEBRUARY 14, 2023

It is the latest in the research lab’s lineage of large language models using Generative Pre-trained Transformer (GPT) technology. Trained with 570 GB of data from books and all the written text on the internet, ChatGPT is an impressive example of the training that goes into the creation of conversational AI.

ChatGPT

ChatGPT Large Language Models OpenAI Conversational AI

Reducing the cost of LLMs with quantization and efficient fine-tuning: how can businesses benefit from Generative AI with limited hardware?

deepsense.ai

FEBRUARY 28, 2024

The wide adoption of ChatGPT and other large language models (LLMs) among individuals made companies of all sizes and across all sectors of industry wonder how they could benefit from this upward-trending technology. The two main topics we will dive into are quantized inference and parameter-efficient fine-tuning.

Generative AI

Generative AI LLM Neural Network Algorithm

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2023

Large language models (LLMs) with billions of parameters are currently at the forefront of natural language processing (NLP). In order to train an LLM to become an expert in a particular domain, fine-tuning is usually required.

Large Language Models

Large Language Models LLM NLP Deep Learning

Exploring the Power of LLama 2 Using Streamlit

Heartbeat

JANUARY 18, 2024

Many advancements have been made since ChatGPT, including open-source and licensed models. An open-source model that is just as good as GPT 3.5 or even GPT 4? Replicate is a cloud platform that hosts large machine learning models for easy deployment. Compared to the popular closed-source model, GPT-3.5,

LLM

LLM Chatbots Large Language Models Python

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

AWS Machine Learning Blog

APRIL 18, 2023

Large language models (LLMs) with billions of parameters are currently at the forefront of natural language processing (NLP). In order to train an LLM to become an expert in a particular domain, fine-tuning is usually required.

LLM

LLM NLP Deep Learning ML

How Patsnap used GPT-2 inference on Amazon SageMaker with low latency and cost

AWS Machine Learning Blog

JULY 24, 2023

This blog post was co-authored, and includes an introduction, by Zilong Bai, senior natural language processing engineer at Patsnap. They use big data (such as a history of past search queries) to provide many powerful yet easy-to-use patent tools. Patsnap had trained a customized GPT-2 model for such a purpose.

Metadata

Metadata Generative AI Natural Language Processing Deep Learning

Levanter: A New Jax Framework for LLM

Bugra Akyildiz

JUNE 18, 2023

Articles Stanford wrote a blog post for a new framework Levanter , a new JAX -based codebase for training foundation models. Scalable : Levanter is designed to scale to large models, and to be able to train on a variety of hardware, including GPUs and TPUs. This model is 4 points behind LLaMA-7B, and 1.3

LLM

LLM Large Language Models Deep Learning ETL

OpenAI announces ChatGPT

Bugra Akyildiz

DECEMBER 3, 2022

I am sure this is relatively cherry-picked example, but it shows the training methodology and reinforcement learning’s success on a very large language model well. The model is often excessively verbose and overuses certain phrases, such as restating that it’s a language model trained by OpenAI.

OpenAI

OpenAI ChatGPT Data Drift Robotics

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

AWS Machine Learning Blog

AUGUST 7, 2023

The video gaming industry has an estimated user base of over 3 billion worldwide 1. It will explain the thought process and experimentation behind the solution, including the model training and development process. The customer had both cost and time constraints that made this solution unviable.

Large Language Models

Large Language Models LLM BERT ML

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

The MLOps Blog

AUGUST 11, 2023

There comes a time when every ML practitioner realizes that training a model in Jupyter Notebook is just one small part of the entire project. How should they be implemented to accommodate scalability and adaptability whilst maintaining an infrastructure that’s easy to troubleshoot? 1 Data Ingestion (e.g., 1 Data Ingestion (e.g.,

ML

ML Machine Learning Data Ingestion Deep Learning

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

AWS Machine Learning Blog

MAY 31, 2023

In 2018, BERT-large made its debut with its 340 million parameters and innovative transformer architecture, setting the benchmark for performance on NLP tasks. Recent advances in ML have given rise to a new class of models known as foundation models , which have billions of parameters and are trained on massive amounts of data.

Automation

Automation Python Prompt Engineer Prompt Engineering

Distributed Training: Errors to Avoid

The MLOps Blog

FEBRUARY 28, 2023

In this era of large language models (LLMs), monolithic foundation models, and increasingly enormous datasets, distributed training is a must, as both data and model weights very rarely fit on a single machine. This article will touch on ten of the most common errors in distributed model training and will suggest solutions to each of them.

Metadata

Metadata Algorithm Large Language Models Deep Learning

Artificial Intelligence Zone

Open source large language models: Benefits, risks and types

The most important AI trends in 2024

Webinars

Trending Sources

Llama 2. A significant milestone in the world of AI

Webinars

Reducing the cost of LLMs with quantization and efficient fine-tuning: how can businesses benefit from Generative AI with limited hardware?

Large Language Models for Product Managers: 5 Things to Know

The Top Large Language Models Going Into 2024

Llama 2. A significant milestone in the world of AI

What Are ChatGPT and Its Friends?

The Ascent of ChatGPT

Reducing the cost of LLMs with quantization and efficient fine-tuning: how can businesses benefit from Generative AI with limited hardware?

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

Exploring the Power of LLama 2 Using Streamlit

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

How Patsnap used GPT-2 inference on Amazon SageMaker with low latency and cost

Levanter: A New Jax Framework for LLM

OpenAI announces ChatGPT

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

Distributed Training: Errors to Avoid

Stay Connected