article thumbnail

Data architecture strategy for data quality

IBM Journey to AI blog

The first generation of data architectures represented by enterprise data warehouse and business intelligence platforms were characterized by thousands of ETL jobs, tables, and reports that only a small group of specialized data engineers understood, resulting in an under-realized positive impact on the business.

article thumbnail

Comparing Tools For Data Processing Pipelines

The MLOps Blog

A typical data pipeline involves the following steps or processes through which the data passes before being consumed by a downstream process, such as an ML model training process. Data Ingestion : Involves raw data collection from origin and storage using architectures such as batch, streaming or event-driven.

ETL 59
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Drowning in Data? A Data Lake May Be Your Lifesaver

ODSC - Open Data Science

Arjuna Chala, associate vice president, HPCC Systems For those not familiar with the HPCC Systems data lake platform, can you describe your organization and the development history behind HPCC Systems? They were interested in creating a data platform capable of managing a sizable number of datasets.

article thumbnail

How to Build Machine Learning Systems With a Feature Store

The MLOps Blog

Keeping track of how exactly the incoming data (the feature pipeline’s input) has to be transformed and ensuring that each model receives the features precisely how it saw them during training is one of the hardest parts of architecting ML systems. This is where feature stores come in. What is a feature store?