Data Integration
Data Integration | News, how-tos, features, reviews, and videos
Data should be a first-class citizen in the cloud
Without proper data governance, interoperability, and access control, enterprises have no hope of maximizing the business value of their data.
DataStax adds Schema GPT Translator to Apache Pulsar-based Astra Streaming
The new Schema GPT Translator is designed to free developers to focus on other aspects real-time data pipelines instead of coping with the time-consuming process of manually creating schema mappings.
Microsoft offers Azure ML data import CLI, SDK for Snowflake, other databases
The new integration, which is in public preview, is designed to bring data into Azure’s ML service from data repositories outside the Azure platform.
Can AI solve IT’s eternal data problem?
New data management and integration solutions featuring AI and machine learning signal that help is on the way to meet the ballooning enterprise data challenge.
Data is a stumbling block for most multicloud deployments
If multicloud is your preferred architecture, you better have a solid plan for interoperability, security, portability, and centralized management. Do you?
Why observability in dataops?
Because building reliable data pipelines is hard, and the first step to becoming a data-driven organization is trusting your data.
3 cloud architecture best practices for industry clouds
As enterprises seek industry clouds to boost their overall cloud ROI, they should architect with integration and security in mind and not limit their choices to one provider.
AWS Glue upgrades Spark engines, backs Ray framework
Serverless data integration service in the Amazon cloud also adds support for built-in Pandas APIs and the Apache Hudi, Apache Iceberg, and Delta Lake formats.
Snowflake taps Python to take on Teradata, Google BigQuery, and Amazon Redshift
Snowflake's updates include support for Python on the Snowpark application development system , data access capabilities, and external tables for on-premises storage.
Review: Redpanda gives Kafka a run for its money
The Kafka-compatible distributed event streaming platform excels in latency and performance and offers a glimpse into the future of streaming with inline WebAssembly transforms and more.
5 ways to improve on spreadsheets for business workflows
When data and workflows get complicated, these platform approaches work far more efficiently and reliably than a spreadsheet.
Use the cloud to strengthen your supply chain
Stressed supply chains cause revenue and productivity issues and disrupt our own lives. Whose supply chains are still working well, and what technology are they using?
Overcoming the IoT interoperability hurdle
As IoT extends into every facet of our lives, the big challenge will be delivering data solutions that are interoperable with legacy, current, and future systems.
Why you should buy, not build, a customer data platform
Creating a single source of truth for customer data and a 360-degree customer view is a complex undertaking. Buying a solution rather than building one streamlines the process.
What is streaming data? Event stream processing explained
Streaming data records are typically small, measured in mere kilobytes, but the stream often goes on and on without ever stopping.
Deep Dive
AI, machine learning, and deep learning deep dive
Download this 26-page in-depth guide to AI, machine learning, and deep learning for easy reading at your convenience
Deep Dive
Get started with Angular
A step-by-step guide to installing the tools, creating an application, and getting up to speed with Angular components, directives, services, and routers
Deep Dive
Python megaguide: The best frameworks and IDEs
Only on InfoWorld: A hands-on, in-depth look at 13 Python web frameworks and six Python development toolkits