Data Engineering
Data Engineering | News, how-tos, features, reviews, and videos
Preview: Google Cloud Dataplex wows
Google Cloud Dataplex is an amazingly complete system for turning raw data from silos into unified data products ready for analysis. And a bit overwhelming to learn.
6 ways to avoid and reduce data debt
Data debt can be just as bad as tech debt, causing security and trust problems if it isn’t addressed throughout the data pipeline.
Alteryx updates its Designer Cloud UI, adds data lakehouse support
The data engineering cloud platform, acquired as part of the Trifacta buyout early this year, has been updated to offer a more Alteryx-like UI experience.
Data lake upstart Upsolver takes aim at Databricks
The San Francisco-based startup has released a SQL-based, self-orchestrating data pipeline platform, claiming it will go to go toe-to-toe with Databricks’ Delta Live Tables.
MIT startup DataCebo offers tool to evaluate synthetic data
Synthetic Data Metrics is an open-source Python library for evaluating model-agnostic tabular data by pitching machine generated data sets against real data sets.
What devops needs to know about data governance
Industry leaders agree that data governance belongs to everyone in IT. Managing the privacy, security, and reliability of data impacts all aspects of the business.
IBM acquires data observability firm Databand.ai
Databand’s data observability platform allows data engineers to tackle challenges associated with bad data at source.
Career roadmap: Data engineer
A combination of education, on-the-job training, and a certificate in data science paved the way from health sciences to data engineering.
Doing data warehousing the wrong way
If data pipelines and streams are the future, why are we still thinking of data as static?
Databricks targets data pipeline automation with Delta Live Tables
The company’s new ETL framework aims to cut down the time taken by data scientists and engineers setting up reliable data pipelines and managing infrastructure.
Career roadmap: Machine learning engineer
As organizations worldwide adopt machine learning across virtually every industry, the demand for machine learning engineers is on the rise.
My data killed my cloud project!
As we push more data to the cloud, avoidable mistakes are hampering migration. The biggest culprit: messy data with inadequate security and integration.
Deep Dive
AI, machine learning, and deep learning deep dive
Download this 26-page in-depth guide to AI, machine learning, and deep learning for easy reading at your convenience
Deep Dive
Get started with Angular
A step-by-step guide to installing the tools, creating an application, and getting up to speed with Angular components, directives, services, and routers
Deep Dive
Python megaguide: The best frameworks and IDEs
Only on InfoWorld: A hands-on, in-depth look at 13 Python web frameworks and six Python development toolkits
Deep Dive
Quick guide: Digital transformation and the agile enterprise
Enterprise transformation is hard. But when you build a platform for continuous change, putting new ideas into production becomes a lot easier
Deep Dive
Career hacks: Professional do’s and don’ts for developers
The hot skills to master, the secrets to breaking into management, the career mistakes to avoid -- here's how to refactor yourself as the developer every organization wants
Deep Dive
Deep Dive how-to: Office 365 document sharing
Office 2016, OneDrive, and Office 365 together offer powerful document collaboration capabilities across Windows, MacOS, iOS, Android, and the web -- but only if you set them up and manage them properly. This how-to guide walks you...