Remove 2014 Remove Data Science Remove ETL
article thumbnail

Ask HN: Who wants to be hired? (July 2025)

Hacker News

I'm JD, a Software Engineer with experience touching many parts of the stack (frontend, backend, databases, data & ETL pipelines, you name it). With over 3 years of working with ETL pipelines and REST API integrations and development, I understand how to develop and maintain robust and scalable data systems.

Python 54
article thumbnail

The Full Stack Data Scientist Part 6: Automation with Airflow

Applied Data Science

Building end-to-end data science solutions means developing data collection, feature engineering, model building and model serving processes. Airflow is a Python based open source orchestration tool developed in-house by Airbnb in 2014 to help their internal workflows. It’s a lot of stuff to stay on top of, right?

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Big Data – Lambda or Kappa Architecture?

Data Science Blog

Kappa – Architecture Jay Kreps introduced the Kappa architecture in 2014 as an alternative to the Lambda architecture. It offers the advantage of having a single ETL platform to develop and maintain. It is well-suited for developing data systems that emphasize online learning and do not require a separate batch layer.

Big Data 130
article thumbnail

What is the Snowflake Data Cloud and How Much Does it Cost?

phData

Effectively this is a way to store the source of truth and build (or rebuild) your downstream data products (including data warehouses) from it. What is the Difference Between a Data Lake and a Data Warehouse? Historically, there were big differences.

article thumbnail

7 Best Machine Learning Workflow and Pipeline Orchestration Tools 2024

DagsHub

Image generated with Midjourney In today’s fast-paced world of data science, building impactful machine learning models relies on much more than selecting the best algorithm for the job. Data scientists and machine learning engineers need to collaborate to make sure that together with the model, they develop robust data pipelines.

article thumbnail

How to Use Exploratory Notebooks [Best Practices]

The MLOps Blog

Jupyter notebooks have been one of the most controversial tools in the data science community. Nevertheless, many data scientists will agree that they can be really valuable – if used well. When data science was sexy , notebooks weren’t a thing yet. In 2014, Project Jupyter evolved from IPython.

SQL 52