Open Data Lakes, Safeguarding Images From AI, Free Data Viz Tools, and 50% Off ODSC East

ODSC - Open Data Science
ODSCJournal
Published in
5 min readFeb 15, 2024

--

The Future of the Single Source of Truth is an Open Data Lake

Organizations that strive for high-performance data systems are increasingly turning towards the ELT (Extract, Load, Transform) model using an open data lake.

3 Tools to Safeguard Images From AI Scraping

Not everyone wants their images or art used for training generative AI models, so these 3 tools can help safeguard your images from use in AI scraping.

5 Free Data Visualization Tools to Showcase Your Data in 2024

These five free data visualization tools will help you make the most out of your data this year, leading to better decision-making for your team.

Transform Global Speech Into Local Language with TalkLocal

TalkLocal is a prototype solution, available as a Python library, and handles the majority of the tasks involved in delivering a solution capable of handling transcription, translation, and subtitle creation in real-time.

Building GeoAI with Lego-Like Simplicity Through Visual Programming

In this preview of an ODSC East session, the author discusses GeoAI, what it is, and how their session will teach you more about it.

Get hands-on, practical experience with cutting-edge, industry-revolutionizing tools from a wide range of disciplines like Generative AI, Machine Learning, LLMs, Data Visualization, LLMOps and MLOps, and much more. Register by Friday for 50% off!

Industry, Opinion, Career Advice

Wishful Thinking Won’t Cut It: A 5-Step Framework for Making AI’s Potential Tangible

A wish for AI without a plan is unlikely to come true. This article gets your AI dreams out of the clouds and onto solid ground with a practical 5-step plan to stop dreaming and start doing.

Prefect: Unveiling Interactive Workflows

Interactive workflows open up a ton of use cases: approval workflows; branching logic; AI/LLM inputs; continuous data streams; and more. To DIY you need to: host an API, build a UI, and run or rent a database. Instead, use Prefect where interactive workflows are now natively supported. Check out this human-in-the-loop tutorial to learn more about how you can build interactive workflows using Prefect.

Data Science & AI News

ODSC’s AI Weekly Recap: Week of February 9th

This week’s AI Weekly Recap is about the EU’s new AI proposal and venture capitalists showcasing how AI is a central issue in funding pitches.

New AI Regulatory Framework Gets Green Light at EU

Recently, representatives of the EU member states met and voted in favor of a new proposal in Brussels related to AI.

Venture Capitalists: AI Now Significant Part of Funding Pitches

Venture capitalists from Kleiner Perkins opened up to TechCrunch about how AI has become a central issue in funding pitches.

New Paper From Apple Hopes to Reduce Error Rates in Speech Recognition Systems

Apple published a paper on a model called Acoustic Model Fusion, which aims to reduce the error rates of speech recognition systems.

Google DeepMind Unveils New Approach to Meta-Learning

Google DeepMind has released a paper on a new method of training neural networks to learn new tasks with limited data.

ODSC Highlights

Announcing the First Speakers for the 2024 Data Engineering Summit

Featuring topics like data-centric AI, Apache tools, and more, these are the first sessions announced for the Data Engineering Summit this April.

Remembering the 2023 Data Engineering Summit in Videos

As we continue preparations for the Data Engineering Summit this April, we’d like to highlight a few of our favorite videos from last year’s event. See them here!

Up Your Machine Learning Game With These ODSC East 2024 Sessions

Covering topics like causal AI, feature stores, and real-time decision-making, there’s a lot to learn from these ODSC East machine learning sessions this April.

Weekly Recap Newsletter

Want to get a weekly digest of AI news from around the world every Friday? Sign up for our new newsletter here!

New Podcast Episode: Large Language Models: Strategies and Best Practices with Sinan Ozdemir

This wide-ranging discussion will take you through Sinan’s inspiration and motivations for writing his book on LLMs, as well as common questions and challenges that arise when working with LLMs. Finally, Sinan shares his take on what the future of LLMs might look like.

Spotify | SoundCloud | Apple

Video of the Week: Towards Explainable and Language-Agnostic LLMs
In this talk, Walid S. Sabas unveils a groundbreaking approach to overcoming the limitations of Large Language Models in natural language processing and artificial intelligence. Addressing the challenges of true language understanding and the unexplainable nature of LLMs due to their deep neural network architecture, Walid proposes a novel solution that integrates symbolic representations with the empirical power of LLMs through bottom-up reverse engineering of language.

Upcoming Webinars:

Interview “No-Code and Low-Code AI: The New Era of Inclusive Tech Development”

Fri, Feb 23, 2024 12:00 PM — 1:00 PM EST

In the upcoming interview on February 23rd, we will speak with Gwendolyn Denise Stripling, PhD, and Artificial Intelligence and Machine Learning Content Developer at Google Cloud and Michael Abel PhD, both authors of the book “Low-Code AI: A Practical Project-Driven Introduction to Machine Learning.” In this interview, we’ll discuss what low-code and no-code AI is, how these topics can make AI more inclusive, and some examples of how to make it work in the real world. We’ll also touch upon details of their book and what went into writing it.

Inference Benchmarking of Prominent Open-Source LLMs

Tue, Feb 27, 2024 12:00 PM — 1:00 PM EST

In the upcoming webinar, we delve into the inference benchmarking of prominent open-source Large Language Models such as the 13B and 70B Llama-2. We have used a diverse range of compute shapes available inOracle Cloud Infrastructure (OCI), like Intel, AMD, ARM CPUs, and NVIDIA GPUs.

--

--

ODSC - Open Data Science
ODSCJournal

Our passion is bringing thousands of the best and brightest data scientists together under one roof for an incredible learning and networking experience.