article thumbnail

All You Need to Know About Gemma, the Open-Source LLM Powerhouse

Analytics Vidhya

Google has been a frontrunner in AI research, contributing significantly to the open-source community with transformative technologies like TensorFlow, BERT, T5, JAX, AlphaFold, and AlphaCode. What is Gemma LLM?

LLM 246
article thumbnail

Mistral AI unveils LLM rivalling major players

AI News

Mistral AI, a France-based startup, has introduced a new large language model (LLM) called Mistral Large that it claims can compete with several top AI systems on the market. Mistral AI stated that Mistral Large outscored most major LLMs except for OpenAI’s recently launched GPT-4 in tests of language understanding.

LLM 243
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Snowflake AI Research Team Unveils Arctic: An Open-Source Enterprise-Grade Large Language Model (LLM) with a Staggering 480B Parameters

Marktechpost

Snowflake AI Research has launched the Arctic , a cutting-edge open-source large language model (LLM) specifically designed for enterprise AI applications, setting a new standard for cost-effectiveness and accessibility.

article thumbnail

AgentLite by Salesforce AI Research: Transforming LLM Agent Development with an Open-Source, Lightweight, Task-Oriented Library for Enhanced Innovation

Marktechpost

However, complexities are involved in developing and evaluating new reasoning strategies and agent architectures for LLM agents due to the intricacy of existing frameworks. A research team from Salesforce AI Research presents AgentLite , an open-source AI Agent library that simplifies the design and deployment of LLM agents.

LLM 126
article thumbnail

Amazon is building a LLM to rival OpenAI and Google

AI News

Amazon is reportedly making substantial investments in the development of a large language model (LLM) named Olympus. Training such massive AI models is a costly endeavour, primarily due to the significant computing power required. The post Amazon is building a LLM to rival OpenAI and Google appeared first on AI News.

LLM 232
article thumbnail

Microsoft AI Research Unveils DeepSpeed-FastGen: Elevating LLM Serving Efficiency with Innovative Dynamic SplitFuse Technique

Marktechpost

Traditional approaches to LLM serving, while adept at training models effectively, falter during inference, especially in tasks like open-ended text generation. vLLM, powered by PagedAttention, and research systems like Orca have improved LLM inference performance. lower tail latency compared to vLLM. Check out the Paper.

LLM 104
article thumbnail

This AI Research Introduces Fast and Expressive LLM Inference with RadixAttention and SGLang

Marktechpost

Frontend: Easy LLM Programming with SGLang The team also presents SGLang, an embedded domain-specific language in Python, on the front end. The researchers recorded the throughput their system attained when testing it on the following typical LLM workloads: MMLU: A multi-tasking, 5-shot, multiple-choice test. advice v0.1.8,

LLM 86