article thumbnail

Modular nabs $100M for its AI programming language and inference engine - SiliconANGLE

Flipboard

Modular Inc., the creator of a programming language optimized for developing artificial intelligence software, has raised $100 million in fresh funding.General Catalyst led the investment, which w

article thumbnail

Together AI Unveils Revolutionary Inference Stack: Setting New Standards in Generative AI Performance

Marktechpost

The Together Inference Engine, capable of processing over 400 tokens per second on Meta Llama 3 8B, integrates the latest innovations from Together AI, including FlashAttention-3, faster GEMM and MHA kernels, and quality-preserving quantization, as well as speculative decoding techniques.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Revolutionizing Fine-Tuned Small Language Model Deployments: Introducing Predibase’s Next-Gen Inference Engine

Marktechpost

Predibase announces the Predibase Inference Engine , their new infrastructure offering designed to be the best platform for serving fine-tuned small language models (SLMs). The Predibase Inference Engine addresses these challenges head-on, offering a tailor-made solution for enterprise AI deployments.

article thumbnail

OpenPerPlex: A New Open-Source AI Search Engine that Leverages Cutting-Edge Technologies to Provide Search Capabilities over the Web

Marktechpost

The integration with Google search through a specialized API enhances the breadth of information available, while a powerful inference engine ensures efficient processing. It also uses a reranking system to refine the results based on relevance. OpenPerPlex offers several features that highlight its capabilities.

article thumbnail

MEGA-Bench: A Comprehensive AI Benchmark that Scales Multimodal Evaluation to Over 500 Real-World Tasks at a Manageable Inference Cost

Marktechpost

Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted) The post MEGA-Bench: A Comprehensive AI Benchmark that Scales Multimodal Evaluation to Over 500 Real-World Tasks at a Manageable Inference Cost appeared first on MarkTechPost.

article thumbnail

Rethinking Direct Alignment: Balancing Likelihood and Diversity for Better Model Performance

Marktechpost

Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted) The post Rethinking Direct Alignment: Balancing Likelihood and Diversity for Better Model Performance appeared first on MarkTechPost. If you like our work, you will love our newsletter.

article thumbnail

Katanemo Open Sources Arch-Function: A Set of Large Language Models (LLMs) Promising Ultra-Fast Speeds at Function-Calling Tasks for Agentic Workflows

Marktechpost

Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted) The post Katanemo Open Sources Arch-Function: A Set of Large Language Models (LLMs) Promising Ultra-Fast Speeds at Function-Calling Tasks for Agentic Workflows appeared first on MarkTechPost.