2017 and Python - Data Science Current

Data lakehouse

Dataconomy

JUNE 18, 2025

Emergence of the term “data lakehouse” The term “data lakehouse” first appeared in documentation around 2017, with significant attention drawn by Databricks in 2020. Programming language support: Compatibility with programming languages like Python, Scala, and other APIs.

Data Lakes

Data Lakes Data Warehouse Business Intelligence Business Intelligence

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

Discover Llama 4 models in SageMaker JumpStart SageMaker JumpStart provides FMs through two primary interfaces: SageMaker Studio and the Amazon SageMaker Python SDK. Alternatively, you can use the SageMaker Python SDK to programmatically access and use SageMaker JumpStart models. billion in 2017 to a projected $37.68

AWS

AWS Machine Learning Machine Learning ML

Evaluating Long-Context Question & Answer Systems

Eugene Yan

JUNE 21, 2025

in 2017 , is designed to test genuine narrative comprehension rather than surface-level pattern matching. To build L-Eval, the authors first created four new datasets: Coursera (educational content), SFiction (science fiction stories), CodeU (Python codebases), and LongFQA (financial earnings).

Clustering

Clustering Natural Language Processing AI AI

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

SIMD-friendly algorithms for substring searching (2016)

Hacker News

JUNE 13, 2025

SIMD-friendly algorithms for substring searching Author: Wojciech MuÅa Added on: 2016-11-28 Updated on: 2018-02-14 (spelling), 2017-04-29 (ARMv8 results) Introduction Popular programming languages provide methods or functions which locate a substring in a given string. All these APIs were designed for one-shot searches.

Algorithm

Algorithm Python

Customize Amazon Nova models to improve tool usage

AWS Machine Learning Blog

APRIL 28, 2025

script with an argparse arg adding two gpus GT tool: terminal LLM output tool: terminal Pred args: ['python run.py gpus 2'] Ground truth pattern: python(3?) Example 2: User question: Who had the most rushing touchdowns for the bengals in 2017 season? gpus 2 Arg matching method: regex match Arg matching score: 1.0

AWS

AWS AI AI Computer Science

I counted all of the yurts in Mongolia using machine learning

Hacker News

JUNE 18, 2025

a day (2017 PPP) (% of population) 11.6% → 0.2% I wrote a Python script that generated tiles from a box around Ulaanbaatar and downloaded them to a folder to use as training data. Indicator Value Years Population 3,481,145 2023 Fertility rate 2.7 The formula for calculating the number of tiles at a given zoom (z) level is: $2^z * 2^z$.

Machine Learning

Machine Learning Machine Learning Python AI

Fine-tune large language models with reinforcement learning from human or AI feedback

Flipboard

APRIL 4, 2025

2017) provided the first evidence that RLHF could be economically scaled up to practical applications. Do not forget to restart your Python kernel after installing the preceding libraries before you import them. 2017) Deep reinforcement learning from human preferences. Christiano et al. Rafailov R. Christiano P.

AI

AI AI Algorithm Artificial Intelligence

Evaluating generative AI models with Amazon Nova LLM-as-a-Judge on Amazon SageMaker AI

AWS Machine Learning Blog

JULY 17, 2025

The provided Python code guides you through the entire workflow. The Claude generation function used the bedrock-runtime AWS SDK for Python (Boto3) client, which accepted a user prompt and returned the model’s text completion: # Initialize Bedrock client once bedrock = boto3.client("bedrock-runtime", and Anthropic’s Claude 3.7.

AI

AI AI AWS Machine Learning

Announcing the First Speakers for ODSC West 2025

ODSC - Open Data Science

JULY 14, 2025

He can teach you about Data Analysis, Java, Python, PostgreSQL, Microservices, Containers, Kubernetes, and some JavaScript. Emmanuel has worked on ML pipelines since 2017 at Instacart and Cruise. Steven Pousty, PhD, Principal and Founder of Tech Raven Consulting Steve is a dad, partner, son, and founder of Tech Raven Consulting.

Machine Learning

Machine Learning Machine Learning ML ML

Announcing the First Speakers for ODSC West 2025

ODSC - Open Data Science

JULY 3, 2025

He can teach you about Data Analysis, Java, Python, PostgreSQL, Microservices, Containers, Kubernetes, and some JavaScript. Emmanuel has worked on ML pipelines since 2017 at Instacart and Cruise. Steven Pousty, PhD, Principal and Founder of Tech Raven Consulting Steve is a dad, partner, son, and founder of Tech Raven Consulting.

Machine Learning

Machine Learning Machine Learning ML ML

Content Moderation: What It Is, How It Works, and the Best APIs

AssemblyAI

DECEMBER 15, 2024

In 2017, several major brands were up in arms when they found their advertising content had been placed next to videos about terrorism on a major video sharing platform. Content Moderation Tutorial Want to learn how to do Content Moderation on audio files in Python?

Azure

Azure AWS AI AI

The Xerox Alto, Smalltalk, and Rewriting a Running GUI

Hacker News

JUNE 9, 2025

Most modern object-oriented languages, from Objective-C and Go to Java and Python, show the influence of Smalltalk. October 22, 2017 at 10:09 AM Unknown said. October 22, 2017 at 4:12 PM Alan Kay said. October 22, 2017 at 11:58 PM Anonymous said. October 23, 2017 at 6:25 AM Tom said.

AWS

AWS Python

A masochist's guide to web development

Hacker News

JUNE 6, 2025

A web server such darkhttpd or the Python http.server package; the examples will use darkhttpd. WebAssembly (or WASM for short) is supported by all major browsers since around 2017. In order to follow them you are going to need: A working installation of Emscripten (which also includes Node.js). > int main() { printf("Hello, web!

Algorithm

Algorithm Python

Evolving Trends in Data Science: Insights from ODSC Conference Sessions from 2015 to 2024

ODSC - Open Data Science

MARCH 10, 2025

Tools like Python , R , and SQL were mainstays, with sessions centered around data wrangling, business intelligence, and the growing role of data scientists in decision-making. By 2017, deep learning began to make waves, driven by breakthroughs in neural networks and the release of frameworks like TensorFlow.

Data Science

Data Science Deep Learning Deep Learning Machine Learning

Modifying an HDMI dummy plug's EDID using a Raspberry Pi

Hacker News

JUNE 15, 2025

How I fixed the infamous Basilisk II Windows “Black Screen” bug in 2013 Apple’s long-lost hidden recovery partition from 1994 has been found The gooey rubber that’s slowly ruining old hard drives The invalid 68030 instruction that accidentally allowed the Mac Classic II to successfully boot up Easy repair of a defective NZXT (..)

Python

New projects contribute to digital commons

Hacker News

JUNE 25, 2025

Circuit Painter is implemented as a simplified Python-based language, using vector graphics-inspired techniques such as matrix transformation to simplify board generation. It enables users to easily automate circuit designs that involve repetitive tasks such as LED matrixes, sensors, and test boards.

EDA

EDA Algorithm Database Data Visualization

Understanding and coding Neural Networks From Scratch in Python and R

Analytics Vidhya

JULY 23, 2020

Note: This article was originally published on May 29, 2017, and updated on July 24, 2020 Overview Neural Networks is one of the most. The post Understanding and coding Neural Networks From Scratch in Python and R appeared first on Analytics Vidhya.

Python

Python Analytics Analytics Algorithm

Exciting Things about Python that Every User Should Know!

Analytics Vidhya

MARCH 9, 2022

Introduction Python is a really interesting programming language and by the end of this blog, you’ll also understand why. The IEEE Spectrum has ranked Python #1 in their list of top programming languages, 2020. It has maintained its position at #1 since the year 2017. It definitely took some time for python […].

Python

Python Analytics Analytics

What is Realtalk’s relationship to AI? (2024)

Hacker News

JULY 10, 2025

From 2017 to Covid, Dynamicland was a community workspace in Oakland, California. A process language fills in gaps in this natural-language structure, and can be whatever is appropriate â currently, Lua, C, C++, Python, JavaScript, Julia, or Haskell. Realtalk-2017 was the first Realtalk system proper.

AI

AI AI Computer Science Computer Science

Finding a 27-year-old easter egg in the Power Mac G3 ROM

Hacker News

JUNE 24, 2025

How I fixed the infamous Basilisk II Windows “Black Screen” bug in 2013 Apple’s long-lost hidden recovery partition from 1994 has been found The gooey rubber that’s slowly ruining old hard drives The invalid 68030 instruction that accidentally allowed the Mac Classic II to successfully boot up Easy repair of a defective NZXT (..)

Python

Introducing Pyrefly: A new type checker and IDE experience for Python

Hacker News

MAY 15, 2025

Today we are announcing an alpha version of Pyrefly , an open source Python type checker and IDE extension crafted in Rust. Pyrefly is a static typechecker that analyzes Python code to ensure type consistency and help you catch errors throughout your codebase before your code runs. Open source Python is open source, and hugely popular.

Python

Understanding Transformers: A Deep Dive into NLP’s Core Technology

Analytics Vidhya

APRIL 16, 2024

Introduction Welcome into the world of Transformers, the deep learning model that has transformed Natural Language Processing (NLP) since its debut in 2017. These linguistic marvels, armed with self-attention mechanisms, revolutionize how machines understand language, from translating texts to analyzing sentiments.

Natural Language Processing

Natural Language Processing Deep Learning Deep Learning Analytics

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK

Flipboard

JUNE 30, 2025

To address this need, we are introducing a new capability in the SageMaker Python SDK that revolutionizes how you build and deploy inference workflows on SageMaker. In this post, we provide an overview of the user experience, detailing how to set up and deploy these workflows with multiple models using the SageMaker Python SDK.

Python

Python AI AI AWS

Altering Python attribute handling for modules

Hacker News

SEPTEMBER 11, 2023

A recent discussion on the Python forum looked at a way to protect module objects (and users) from mistaken attribute assignment and deletion. There are ways to get the same effect today, but the mechanism that would be used causes a performance penalty for an unrelated, and heavily used, action: attribute lookup on modules.

Python

GoLang for Data Science

Data Science 101

APRIL 26, 2019

Gopher Data – Gophers doing data analysis, no schedule events, last blog post was 2017 Gopher Notes – Golang in Jupyter Notebooks Lgo – Interactive programming with Jupyter for Golang Gota – Data frames for Go, “The API is still in flux so use at your own risk.” Golang Data Science Books. Thoughts from the Community.

Data Science

Data Science Machine Learning Machine Learning Python

Explosion in 2017: Our Year in Review

Explosion

JANUARY 12, 2018

spaCy In 2017 spaCy grew into one of the most popular open-source libraries for Artificial Intelligence. In April 2017, we published a follow up that described the solution we were working on, and in August we introduced Prodigy , and started accepting beta users. spaCy’s Machine Learning library for NLP in Python. cython-blis

Machine Learning

Machine Learning Machine Learning Supervised Learning Python

Eight Graphs that Explain Software Engineering Salaries in 2023

Flipboard

MARCH 17, 2023

percent in 2022 compared with 2021, reflecting a steady upward trend since 2017 (with 2020 omitted due to the pandemic disruption). If you’re a software engineer and you don’t know Python, you’d better start studying. Tech Salaries Jump, But Don’t Keep Up With Inflation According to Dice’s numbers , tech salaries grew 2.3

Data Scientist

Data Scientist Artificial Intelligence Artificial Intelligence Machine Learning

Build a crop segmentation machine learning model with Planet data and Amazon SageMaker geospatial capabilities

AWS Machine Learning Blog

SEPTEMBER 29, 2023

Our results reveal that the classification from the KNN model is more accurately representative of the state of the current crop field in 2017 than the ground truth classification data from 2015. However, Landsat 8 lower-resolution imagery could have been used as a bridge between 2015 and 2017.

Machine Learning

Machine Learning Machine Learning ML ML

I Built an OpenAI-Style Swarm That Runs Entirely on My Laptop. Here’s How.

Towards AI

NOVEMBER 18, 2024

A developer’s journey into creating a privacy-focused, cost-effective multi-agent system using Python and open-source LLMs. When I started learning about machine learning and deep learning in my pre-final year of undergrad in 2017–18, I was amazed by the potential of these models. This member-only story is on us.

Deep Learning

Deep Learning Deep Learning Data Scientist Machine Learning

Getting Started with AI

Towards AI

AUGUST 25, 2023

Mirjalili, Python Machine Learning, 2nd ed. Packt, ISBN: 978–1787125933, 2017. McKinney, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 2nd ed., O’Reilly Media, ISBN: 978–1491957660, 2017. Natural Language Processing with Python — Analyzing Text with the Natural Language Toolkit.

Machine Learning

Machine Learning Machine Learning AI AI

Introduction

Towards AI

OCTOBER 31, 2023

Therefore, below is the monthly average price of HDB flats from January 2017 to August 2023. Monthly Transactions The image below shows the monthly transactions from January 2017 to August 2023. With that in mind, hopefully this perspective can also add fresh insights and improve the robustness of existing models.

Exploratory Data Analysis

Exploratory Data Analysis Tableau Data Visualization Data Analysis

Top Programming Languages For Data Developers In 2019

Smart Data Collective

JUNE 10, 2019

Python is one of the most important languages for data science. The popularity of python has been on the rise and is showing no signs of waning. Likewise, other popular web development frameworks such as a pyramid, Django and turbo gear are all python-based. This is a minimal programing language similar to python.

Python

Python Data Scientist Machine Learning Machine Learning

Getting Started with OpenAI: The Lingua Franca of AI

Towards AI

APRIL 4, 2024

pip install python-dotenv Then, create a file named.env in the root directory of their project. Yarnit U+007C Generative AI platform for personalized content creation Discover the power of Yarnit.app, the generative AI driven digital content creation platform. To do this, you’ll need to import the libraries.

AI

AI AI Python Artificial Intelligence

70+ Best and Unique Python Machine Learning Projects with source code [2023]

Mlearning.ai

JUNE 6, 2023

In today’s blog, we will see some very interesting Python Machine Learning projects with source code. This is one of the best Machine learning projects in Python. Doctor-Patient Appointment System in Python using Flask Hey guys, in this blog we will see a Doctor-Patient Appointment System for Hospitals built in Python using Flask.

Machine Learning

Machine Learning Machine Learning Python Deep Learning

Simplifying Time Series Analysis for Data Scientists

ODSC - Open Data Science

SEPTEMBER 12, 2023

A variety of time-series functions are included by default, such as cumulative sums, time-weighted averages, and moving averages, and you can also create user-defined functions (UDF) in Python or C.

Data Scientist

Data Scientist Data Lakes Database Data Science

Top Companies to work for if you are a data scientist

Data Science 101

APRIL 12, 2019

LinkedIn’s 2017 report had put Data Scientist as the second fastest growing profession and it’s number one on 2019’s list of most promising jobs. There are three main reasons why data science has been rated as a top job according to research. How can you get a job as a data scientist?

Data Scientist

Data Scientist Data Science DataOps Hadoop

My C++ Now 2023 talk is online: “A TypeScript for C++”

Hacker News

AUGUST 13, 2023

8:00 – summary slide of features demonstrated at CppCon 2022 – safety for C++; goal of 50x fewer CVEs due to type/bounds/lifetime/init safety – simplicity for C++; goal of 10x less to know 10:00 – 2.

Python

Comprehensive Guide: Top Computer Vision Resources All in One Blog

Mlearning.ai

JANUARY 27, 2023

How to read an image in Python using OpenCV — 2023 2. Rotating and Scaling Images using cv2 — a fun Python application — 2023 5. How to use mouse clicks to draw circles in Python using OpenCV — easy project — 2023 6. How to use mouse clicks to draw circles in Python using OpenCV — easy project — 2023 6.

Deep Learning

Deep Learning Deep Learning Python Data Scientist

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

AWS Machine Learning Blog

MAY 10, 2023

Today, we are excited to announce that JupyterLab users can install and use the CodeWhisperer extension for free to generate real-time, single-line, or full-function code suggestions for Python notebooks in JupyterLab and Amazon SageMaker Studio. In 2016, he co-created the Altair package for statistical visualization in Python.

ML

ML ML AWS Data Science

Top 10 Generative AI Companies Revealed

Towards AI

APRIL 19, 2024

Amazon (AWS) 👉Industry domain: Online retail and web services provider 👉Location: Over 175 Amazon fulfillment centers globally 👉Year founded: 1994 👉Key Products developed: Amazon Bedrock, Q, Code Whisperer, Sage Maker 👉Benefits: Fully managed generative AI service options, AWS free tier for experimentation 7.

AI

AI AI Artificial Intelligence Artificial Intelligence

What’s New in PyTorch 2.0? torch.compile

Flipboard

MARCH 27, 2023

The success of PyTorch is attributed to its simplicity, first-class Python integration, and imperative style of programming. Since the launch of PyTorch in 2017, it has strived for high performance and eager execution. is available as a Python pip package. torch.compile We start this lesson by learning to install PyTorch 2.0.

Deep Learning

Deep Learning Deep Learning Python Natural Language Processing

Introduction to Pandas for Machine Learning

How to Learn Machine Learning

DECEMBER 11, 2022

In this article we will provide a brief introduction to Pandas, one of the most famous Python libraries for Data Science and Machine learning. Introduction to Pandas – The fundamentals Pandas is a popular and powerful open-source data analysis and manipulation library for the Python programming language. Lets get to it!

Machine Learning

Machine Learning Machine Learning Data Analysis Data Analysis

Taking Pandas To The Next Level With LLMs

Mlearning.ai

MAY 15, 2023

describe() count 9994 mean 2017-04-30 05:17:08.056834048 min 2015-01-03 00:00:00 25% 2016-05-23 00:00:00 50% 2017-06-26 00:00:00 75% 2018-05-14 00:00:00 max 2018-12-30 00:00:00 Name: Order Date, dtype: object Average sales per year df['year'] = df['Order Date'].apply(lambda Yearly average sales. Convert it into a graph.

Data Science

Data Science Machine Learning Machine Learning AI

Arranging Invisible Icons in Quadratic Time

Hacker News

FEBRUARY 16, 2021

When my Python script started creating images the explorer.exe process would notice and immediately start trying to lay out icons. Instead you can read about how I used 19 different commute methods in September 2018 , or 20 different commute methods in April 2017. Tired of reading boring performance analysis?

Algorithm

Algorithm Python

Data lakehouse

Llama 4 family of models from Meta are now available in SageMaker JumpStart

Webinars

Trending Sources

Evaluating Long-Context Question & Answer Systems

Webinars

SIMD-friendly algorithms for substring searching (2016)

Customize Amazon Nova models to improve tool usage

I counted all of the yurts in Mongolia using machine learning

Fine-tune large language models with reinforcement learning from human or AI feedback

Evaluating generative AI models with Amazon Nova LLM-as-a-Judge on Amazon SageMaker AI

Announcing the First Speakers for ODSC West 2025

Announcing the First Speakers for ODSC West 2025

Content Moderation: What It Is, How It Works, and the Best APIs

The Xerox Alto, Smalltalk, and Rewriting a Running GUI

A masochist's guide to web development

Evolving Trends in Data Science: Insights from ODSC Conference Sessions from 2015 to 2024

Modifying an HDMI dummy plug's EDID using a Raspberry Pi

New projects contribute to digital commons

Understanding and coding Neural Networks From Scratch in Python and R

Exciting Things about Python that Every User Should Know!

What is Realtalk’s relationship to AI? (2024)

Finding a 27-year-old easter egg in the Power Mac G3 ROM

Introducing Pyrefly: A new type checker and IDE experience for Python

Understanding Transformers: A Deep Dive into NLP’s Core Technology

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK

Altering Python attribute handling for modules

GoLang for Data Science

Explosion in 2017: Our Year in Review

Eight Graphs that Explain Software Engineering Salaries in 2023

Build a crop segmentation machine learning model with Planet data and Amazon SageMaker geospatial capabilities

I Built an OpenAI-Style Swarm That Runs Entirely on My Laptop. Here’s How.

Getting Started with AI

Introduction

Top Programming Languages For Data Developers In 2019

Getting Started with OpenAI: The Lingua Franca of AI

70+ Best and Unique Python Machine Learning Projects with source code [2023]

Simplifying Time Series Analysis for Data Scientists

Top Companies to work for if you are a data scientist

My C++ Now 2023 talk is online: “A TypeScript for C++”

Comprehensive Guide: Top Computer Vision Resources All in One Blog

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

Top 10 Generative AI Companies Revealed

What’s New in PyTorch 2.0? torch.compile

Introduction to Pandas for Machine Learning

Taking Pandas To The Next Level With LLMs

Arranging Invisible Icons in Quadratic Time

Stay Connected