2015, Clustering and Data Science - Data Science Current

Top 6 Kubernetes use cases

IBM Journey to AI blog

NOVEMBER 13, 2023

But Docker lacked an automated “orchestration” tool, which made it time-consuming and complex for data science teams to scale applications. Nodes run the pods and are usually grouped in a Kubernetes cluster, abstracting the underlying physical hardware resources.

Machine Learning

Machine Learning Machine Learning ML ML

How Meesho built a generalized feed ranker using Amazon SageMaker inference

AWS Machine Learning Blog

OCTOBER 20, 2023

Meesho was founded in 2015 and today focuses on buyers and sellers across India. Model training Meesho used Amazon EMR with Apache Spark to process hundreds of millions of data points, depending on the model’s complexity. One of the major challenges was to run distributed training at scale.

AWS

AWS Data Scientist ML ML

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

Our high-level training procedure is as follows: for our training environment, we use a multi-instance cluster managed by the SLURM system for distributed training and scheduling under the NeMo framework. Dr. Huan works on AI and Data Science. He focuses on developing scalable machine learning algorithms. Youngsuk Park is a Sr.

AWS

AWS Machine Learning Machine Learning Deep Learning

Exploring Google’s AI Tools: A Deep Dive into the Future of Data Science

ODSC - Open Data Science

OCTOBER 15, 2024

During a recent episode of ODSC’s Ai X Podcast with Paige Bailey, Engineering Lead for Gen AI Development Experience at Google, we delved into the groundbreaking AI tools and platforms that are shaping the future of data science. Check out her talk, “ Data Science in the Age of Generative AI ,” there!

Data Science

Data Science Data Scientist AI AI

Demand forecasting at Getir built with Amazon Forecast

AWS Machine Learning Blog

MAY 15, 2023

Getir was founded in 2015 and operates in Turkey, the UK, the Netherlands, Germany, France, Spain, Italy, Portugal, and the United States. Solution overview Six people from Getir’s data science team and infrastructure team worked together on this project. Getir is the pioneer of ultrafast grocery delivery.

Algorithm

Algorithm Data Scientist Machine Learning Machine Learning

23 Best Free NLP Datasets for Machine Learning

Iguazio

SEPTEMBER 20, 2023

The data file format comprises the Tweet’s polarity, IT, date, query, user and text. Twitter US Airline Sentiment Polarized Tweets from February 2015 about the large US airlines. Data is provided in a CSV file and SQLite database. Get the dataset here. Get the dataset here. Synonyms 12. Get the dataset here.

Machine Learning

Machine Learning Machine Learning Database Data Scientist

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

Explore the model pre-training workflow from start to finish, including setting up clusters, troubleshooting convergence issues, and running distributed training to improve model performance. Gain hands-on experience in data management, model training, monitoring, and seamless deployment to production environments.

AWS

AWS ML ML AI

Elon Musk’s xAI Unveils Grok 3 AI Model, Claims Edge Over OpenAI and DeepSeek

ODSC - Open Data Science

FEBRUARY 20, 2025

OpenAI, a company Musk co-founded in 2015, introduced its GPT-4-based model, the o1, last year, which showcased strong problem-solving abilities in coding, math, andscience. The company disclosed Tuesday that it had doubled its GPU cluster to 200,000 Nvidia units for Grok 3s training, up from 100,000 in2023. Whats Next?

AI

AI AI Artificial Intelligence Artificial Intelligence

What is Snowpark — and Why Does it Matter? A phData Perspective

phData

SEPTEMBER 20, 2023

We think those workloads fall into three broad categories: Data Science and Machine Learning – Data Scientists love Python, which makes Snowpark Python an ideal framework for machine learning development and deployment. phData has been working in data engineering since the inception of the company back in 2015.

SQL

SQL Python Data Lakes Machine Learning

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

Since joining SnapLogic in 2010, Greg has helped design and implement several key platform features including cluster processing, big data processing, the cloud architecture, and machine learning. He currently is working on Generative AI for data integration.

Database

Database AWS ETL SQL

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2023

per diluted share, for the year ended December 31, 2015. per diluted share, for the year ended December 31, 2015. per diluted share, for the year ended December 31, 2015. per diluted share, for the year ended December 31, 2015. per diluted share, compared to $3,818,000, or $0.21

ML

ML ML Deep Learning Deep Learning

Building a Predictive Model in KNIME

phData

MARCH 6, 2023

If you spend even a few minutes on KNIME’s website or browsing through their whitepapers and blog posts, you’ll notice a common theme: a strong emphasis on data science and predictive modeling. Predicting Crimes in Phoenix, Arizona We have a dataset containing nearly 400,000 crimes committed in Phoenix, Arizona between 2015 and 2021.

Decision Trees

Decision Trees Analytics Analytics Data Science

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

AWS Machine Learning Blog

JANUARY 13, 2023

They were admitted to one of 335 units at 208 hospitals located throughout the US between 2014–2015. Due to the underlying heterogeneity and distributed nature of the data, it provides an ideal real-world example to test this FL framework. Please follow the steps listed here to install wandb and setup monitoring for this solution.

AWS

AWS Analytics Analytics Machine Learning

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

AWS Machine Learning Blog

APRIL 18, 2023

per diluted share, for the year ended December 31, 2015. per diluted share, for the year ended December 31, 2015. per diluted share, for the year ended December 31, 2015. per diluted share, for the year ended December 31, 2015. per diluted share, compared to $3,818,000, or $0.21

ML

ML ML Deep Learning Deep Learning

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Pickl AI

AUGUST 22, 2024

Overview of TensorFlow TensorFlow , developed by Google Brain, is a robust and versatile deep learning framework that was introduced in 2015. Scalability TensorFlow can handle large datasets and scale to distributed clusters, making it suitable for training complex models.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

The Story Continues: Announcing Version 14 of Wolfram Language and Mathematica

Hacker News

JANUARY 9, 2024

One very simple example (introduced in 2015) is Nothing : Another, introduced in 2020, is Splice : An old chestnut of Wolfram Language design concerns the way infinite evaluation loops are handled. but with things like clustering). And in Version 13.2 We’ve had “basic, raw NDSolve ” since 1991.

Python

Python Algorithm Machine Learning Machine Learning

Meet the Winners of the Youth Mental Health Narratives Challenge

DrivenData Labs

FEBRUARY 3, 2025

Most solvers were data science professionals, professors, and students, but there were also many data analysts, project managers, and people working in public health and healthcare. His journey in AI began in 2015 with a master's in computer vision for biomedical image analysis. Alejandro A.

Machine Learning

Machine Learning Machine Learning Data Science Natural Language Processing

Data Science Current

Top 6 Kubernetes use cases

How Meesho built a generalized feed ranker using Amazon SageMaker inference

Trending Sources

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Exploring Google’s AI Tools: A Deep Dive into the Future of Data Science

Demand forecasting at Getir built with Amazon Forecast

23 Best Free NLP Datasets for Machine Learning

Your guide to generative AI and ML at AWS re:Invent 2024

Elon Musk’s xAI Unveils Grok 3 AI Model, Claims Edge Over OpenAI and DeepSeek

What is Snowpark — and Why Does it Matter? A phData Perspective

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

Building a Predictive Model in KNIME

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

Comparative Analysis: PyTorch vs TensorFlow vs Keras

The Story Continues: Announcing Version 14 of Wolfram Language and Mathematica

Meet the Winners of the Youth Mental Health Narratives Challenge

Stay Connected