AI, Clustering and ML - Data Science Current

Adding Explainability to Clustering

Analytics Vidhya

MAY 26, 2022

Explainable AI is no longer just an optional add-on when using ML algorithms for corporate decision making. The post Adding Explainability to Clustering appeared first on Analytics Vidhya. Introduction The ability to explain decisions is increasingly becoming important across businesses.

Clustering

Clustering Algorithm Data Science ML

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

This year, generative AI and machine learning (ML) will again be in focus, with exciting keynote announcements and a variety of sessions showcasing insights from AWS experts, customer stories, and hands-on experiences with AWS services. Fifth, we’ll showcase various generative AI use cases across industries.

AWS

AWS ML ML AI

Speed up your cluster procurement time with Amazon SageMaker HyperPod training plans

AWS Machine Learning Blog

DECEMBER 5, 2024

In this post, we demonstrate how you can address this requirement by using Amazon SageMaker HyperPod training plans , which can bring down your training cluster procurement wait time. We further guide you through using the training plan to submit SageMaker training jobs or create SageMaker HyperPod clusters. Create a new training plan.

Clustering

Clustering AWS Python ML

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Racing into the future: How AWS DeepRacer fueled my AI and ML journey

AWS Machine Learning Blog

NOVEMBER 19, 2024

At the time, I knew little about AI or machine learning (ML). But AWS DeepRacer instantly captured my interest with its promise that even inexperienced developers could get involved in AI and ML. Panic set in as we realized we would be competing on stage in front of thousands of people while knowing little about ML.

AWS

AWS ML ML AI

Ray: Your Gateway to Scalable AI and Machine Learning Applications

Analytics Vidhya

MARCH 25, 2025

Ray has emerged as a powerful framework for distributed computing in AI and ML workloads, enabling researchers and practitioners to scale their applications from laptops to clusters with minimal code changes.

Machine Learning

Machine Learning Machine Learning Clustering ML

Identification of Hazardous Areas for Priority Landmine Clearance: AI for Humanitarian Mine Action

ML @ CMU

NOVEMBER 7, 2024

In close collaboration with the UN and local NGOs, we co-develop an interpretable predictive tool for landmine contamination to identify hazardous clusters under geographic and budget constraints, experimentally reducing false alarms and clearance time by half. RELand consistently outperforms the benchmark models on all relevant metrics.

Clustering

Clustering Cross Validation Machine Learning Machine Learning

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Amazon SageMaker supports geospatial machine learning (ML) capabilities, allowing data scientists and ML engineers to build, train, and deploy ML models using geospatial data. We use the purpose-built geospatial container with SageMaker Processing jobs for a simplified, managed experience to create and run a cluster.

ML

ML ML Clustering Machine Learning

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning Blog

APRIL 10, 2025

As cluster sizes grow, the likelihood of failure increases due to the number of hardware components involved. Larger clusters, more failures, smaller MTBF As cluster size increases, the entropy of the system increases, resulting in a lower MTBF. It implies that if a single instance fails, it stops the entire job.

ML

ML ML Clustering AWS

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Machine learning (ML) helps organizations to increase revenue, drive business growth, and reduce costs by optimizing core business functions such as supply and demand forecasting, customer churn prediction, credit risk scoring, pricing, predicting late shipments, and many others. For this post we’ll use a provisioned Amazon Redshift cluster.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

Real value, real time: Production AI with Amazon SageMaker and Tecton

AWS Machine Learning Blog

DECEMBER 4, 2024

Businesses are under pressure to show return on investment (ROI) from AI use cases, whether predictive machine learning (ML) or generative AI. Only 54% of ML prototypes make it to production, and only 5% of generative AI use cases make it to production. Using SageMaker, you can build, train and deploy ML models.

ML

ML ML AWS AI

How climate tech startups are building foundation models with Amazon SageMaker HyperPod

Flipboard

JUNE 4, 2025

Climate tech startups are at the forefront of building impactful solutions to the climate crisis, and theyre using generative AI to build as quickly as possible. Trends among climate tech startups building with generative AI Climate tech startups adoption of generative AI is evolving rapidly.

AWS

AWS Clustering ML ML

Hammerspace Unveils the Fastest File System in the World for Training Enterprise AI Models at Scale

insideBIGDATA

MARCH 4, 2024

Hammerspace, the company orchestrating the Next Data Cycle, unveiled the high-performance NAS architecture needed to address the requirements of broad-based enterprise AI, machine learning and deep learning (AI/ML/DL) initiatives and the widespread rise of GPU computing both on-premises and in the cloud.

Deep Learning

Deep Learning Deep Learning Clustering Machine Learning

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning Blog

NOVEMBER 26, 2024

Solution overview The steps to implement the solution are as follows: Create the EKS cluster. Create the EKS cluster If you don’t have an existing EKS cluster, you can create one using eksctl. Adjust the following configuration to suit your needs, such as the Amazon EKS version, cluster name, and AWS Region.

AWS

AWS Clustering ML ML

How Lumi streamlines loan approvals with Amazon SageMaker AI

AWS Machine Learning Blog

APRIL 4, 2025

They use real-time data and machine learning (ML) to offer customized loans that fuel sustainable growth and solve the challenges of accessing capital. These classified transactions then serve as critical inputs for downstream credit risk AI models, enabling more accurate assessments of a businesss creditworthiness.

AI

AI AI Machine Learning Machine Learning

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To reduce costs while continuing to use the power of AI , many companies have shifted to fine tuning LLMs on their domain-specific data using Parameter-Efficient Fine Tuning (PEFT). Manually managing such complexity can often be counter-productive and take away valuable resources from your businesses AI development.

AWS

AWS Clustering Deep Learning Deep Learning

Unleash AI innovation with Amazon SageMaker HyperPod

AWS Machine Learning Blog

MARCH 18, 2025

The rise of generative AI has significantly increased the complexity of building, training, and deploying machine learning (ML) models. It now demands deep expertise, access to vast datasets, and the management of extensive compute clusters.

AI

AI AI AWS Clustering

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning Blog

MARCH 3, 2025

Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. The launcher interfaces with underlying cluster management systems such as SageMaker HyperPod (Slurm or Kubernetes) or training jobs, which handle resource allocation and scheduling. recipes=recipe-name.

Clustering

Clustering AWS ML ML

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

AWS Machine Learning Blog

OCTOBER 18, 2024

In this post, we introduce an innovative solution for end-to-end model customization and deployment at the edge using Amazon SageMaker and Qualcomm AI Hub. After fine-tuning, we show you how to optimize the model with Qualcomm AI Hub so that it’s ready for deployment across edge devices powered by Snapdragon and Qualcomm platforms.

AWS

AWS AI AI Machine Learning

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Flipboard

FEBRUARY 10, 2025

AI agents are rapidly becoming the next frontier in enterprise transformation, with 82% of organizations planning adoption within the next 3 years. According to a Capgemini survey of 1,100 executives at large enterprises, 10% of organizations already use AI agents, and more than half plan to use them in the next year.

AI

AI AI AWS ML

How To Enhance Your Analytics with Insightful ML Approaches

Smart Data Collective

AUGUST 29, 2022

This is why businesses are looking to leverage machine learning (ML). In this article, we will share some best practices for improving your analytics with ML. Top ML approaches to improve your analytics. Clustering. ?lustering They need a more comprehensive analytics strategy to achieve these business goals.

ML

ML ML Analytics Analytics

This AI can predict genetic mutations before they happen

Dataconomy

MARCH 3, 2025

Thanks to machine learning (ML) and artificial intelligence (AI), it is possible to predict cellular responses and extract meaningful insights without the need for exhaustive laboratory experiments. They introduce PERTURBQA , a benchmark designed to align AI-driven perturbation models with real biological decision-making.

AI

AI AI Clustering Machine Learning

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

DataRobot Blog

DECEMBER 6, 2022

Other organizations are just discovering how to apply AI to accelerate experimentation time frames and find the best models to produce results. With a goal to help data science teams learn about the application of AI and ML, DataRobot shares helpful, educational blogs based on work with the world’s most strategic companies.

Data Scientist

Data Scientist ML ML AI

Ray jobs on Amazon SageMaker HyperPod: scalable and resilient distributed AI

AWS Machine Learning Blog

APRIL 2, 2025

At its core, Ray offers a unified programming model that allows developers to seamlessly scale their applications from a single machine to a distributed cluster. Ray promotes the same coding patterns for both a simple machine learning (ML) experiment and a scalable, resilient production application.

Clustering

Clustering AWS AI AI

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Flipboard

DECEMBER 3, 2024

Syngenta and AWS collaborated to develop Cropwise AI , an innovative solution powered by Amazon Bedrock Agents , to accelerate their sales reps’ ability to place Syngenta seed products with growers across North America. Generative AI is reshaping businesses and unlocking new opportunities across various industries.

AWS

AWS AI AI Machine Learning

KNNs & K-Means: The Superior Alternative to Clustering & Classification.

Towards AI

SEPTEMBER 3, 2024

Last Updated on September 3, 2024 by Editorial Team Author(s): Surya Maddula Originally published on Towards AI. Let’s discuss two popular ML algorithms, KNNs and K-Means. We will discuss KNNs, also known as K-Nearest Neighbours and K-Means Clustering. They are both ML Algorithms, and we’ll explore them more in detail in a bit.

K-nearest Neighbors

K-nearest Neighbors Clustering ML ML

Accelerate pre-training of Mistral’s Mathstral model with highly resilient clusters on Amazon SageMaker HyperPod

AWS Machine Learning Blog

SEPTEMBER 18, 2024

The compute clusters used in these scenarios are composed of more than thousands of AI accelerators such as GPUs or AWS Trainium and AWS Inferentia , custom machine learning (ML) chips designed by Amazon Web Services (AWS) to accelerate deep learning workloads in the cloud.

Clustering

Clustering AWS ML ML

How Aetion is using generative AI and Amazon Bedrock to unlock hidden insights about patient populations

AWS Machine Learning Blog

JANUARY 30, 2025

The use of unsupervised learning methods on semi-structured data along with generative AI has been transformative in unlocking hidden insights. Amazon Bedrock is a fully managed service that provides access to high-performing foundation models (FMs) from leading AI startups and Amazon through a unified API.

Clustering

Clustering Natural Language Processing AI AI

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

We recently announced the general availability of cross-account sharing of Amazon SageMaker Model Registry using AWS Resource Access Manager (AWS RAM) , making it easier to securely share and discover machine learning (ML) models across your AWS accounts. Human oversight : Including human involvement in AI decision-making processes.

AWS

AWS ML ML Machine Learning

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

AWS Machine Learning Blog

DECEMBER 18, 2024

AIs transformative impact extends throughout the modern business landscape, with telecommunications emerging as a key area of innovation. Fastweb , one of Italys leading telecommunications operators, recognized the immense potential of AI technologies early on and began investing in this area in 2019.

Clustering

Clustering AWS AI AI

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

AWS Machine Learning Blog

JULY 25, 2024

By accelerating the speed of issue detection and remediation, it increases the reliability of your ML training and reduces the wasted time and cost due to hardware failure. Choose Clusters in the navigation pane, open the trainium-inferentia cluster, choose Node groups, and locate your node group. # install.sh

Clustering

Clustering AWS ML ML

Build scalable containerized RAG based generative AI applications in AWS using Amazon EKS with Amazon Bedrock

Flipboard

MAY 13, 2025

Generative artificial intelligence (AI) applications are commonly built using a technique called Retrieval Augmented Generation (RAG) that provides foundation models (FMs) access to additional data they didnt have during training.

AWS

AWS AI AI Clustering

Meta’s open AI hardware vision

Hacker News

OCTOBER 15, 2024

At the Open Compute Project (OCP) Global Summit 2024, we’re showcasing our latest open AI hardware designs with the OCP community. These innovations include a new AI platform, cutting-edge open rack designs, and advanced network fabrics and components. Prior to Llama, our largest AI jobs ran on 128 NVIDIA A100 GPUs.

Clustering

Clustering AI AI Deep Learning

Integrate HyperPod clusters with Active Directory for seamless multi-user login

AWS Machine Learning Blog

APRIL 22, 2024

Amazon SageMaker HyperPod is purpose-built to accelerate foundation model (FM) training, removing the undifferentiated heavy lifting involved in managing and optimizing a large training compute cluster. In this solution, HyperPod cluster instances use the LDAPS protocol to connect to the AWS Managed Microsoft AD via an NLB.

Clustering

Clustering AWS Machine Learning Machine Learning

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

AWS Machine Learning Blog

NOVEMBER 22, 2024

Companies across various scales and industries are using large language models (LLMs) to develop generative AI applications that provide innovative experiences for customers and employees. By offloading the management and maintenance of the training cluster to SageMaker, we reduce both training time and our total cost of ownership (TCO).

Clustering

Clustering AWS ML ML

OpenSearch Vector Engine is now disk-optimized for low cost, accurate vector search

Flipboard

JANUARY 24, 2025

Overview of vector search and the OpenSearch Vector Engine Vector search is a technique that improves search quality by enabling similarity matching on content that has been encoded by machine learning (ML) models into vectors (numerical encodings). These benchmarks arent designed for evaluating ML models.

K-nearest Neighbors

K-nearest Neighbors ML ML Algorithm

Maintaining large-scale AI capacity at Meta

Hacker News

JUNE 12, 2024

Meta is currently operating many data centers with GPU training clusters across the world. A year ago, however, as the industry reached a critical inflection point due to the rise of artificial intelligence (AI), we recognized that to lead in the generative AI space we’d need to transform our fleet.

Clustering

Clustering AI AI Artificial Intelligence

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning Blog

NOVEMBER 27, 2024

You can use these techniques together to train complex models that are orders of magnitude faster and rapidly iterate and deploy innovative AI solutions that drive business value. After they’re initiated, SageMaker training jobs spin up the cluster, provisioning the specified number and type of compute instances.

AWS

AWS Clustering ML ML

Elevating ML to new heights with distributed learning

Dataconomy

MAY 22, 2023

TensorFlow provides high-level APIs, such as tf.distribute, to distribute training across multiple devices, machines, or clusters. PyTorch: PyTorch , developed by Facebook’s AI Research lab, is another popular distributed learning framework.

ML

ML ML Machine Learning Machine Learning

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

AWS Machine Learning Blog

JUNE 25, 2024

This solution simplifies the integration of advanced monitoring tools such as Prometheus and Grafana, enabling you to set up and manage your machine learning (ML) workflows with AWS AI Chips. By deploying the Neuron Monitor DaemonSet across EKS nodes, developers can collect and analyze performance metrics from ML workload pods.

AWS

AWS ML ML Clustering

Optimizing costs of generative AI applications on AWS

AWS Machine Learning Blog

DECEMBER 26, 2024

The report The economic potential of generative AI: The next productivity frontier , published by McKinsey & Company, estimates that generative AI could add an equivalent of $2.6 The potential for such large business value is galvanizing tens of thousands of enterprises to build their generative AI applications in AWS.

AWS

AWS Database AI AI

From innovation to impact: How AWS and NVIDIA enable real-world generative AI success

AWS Machine Learning Blog

MARCH 19, 2025

As we gather for NVIDIA GTC, organizations of all sizes are at a pivotal moment in their AI journey. The question is no longer whether to adopt generative AI, but how to move from promising pilots to production-ready systems that deliver real business value.

AWS

AWS AI AI Clustering

Keeping up with ML Research: A Tool to Navigate the ML Innovation Maze

Towards AI

FEBRUARY 21, 2024

Author(s): Alessandro Amenta Originally published on Towards AI. Image generated with DALL-E 3 In the fast-paced world of Machine Learning (ML) research, keeping up with the latest findings is crucial and exciting, but let’s be honest — it’s also a challenge. What’s the next big thing in ML?

ML

ML ML Machine Learning Machine Learning

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 12, 2024

Sharing in-house resources with other internal teams, the Ranking team machine learning (ML) scientists often encountered long wait times to access resources for model training and experimentation – challenging their ability to rapidly experiment and innovate. If it shows online improvement, it can be deployed to all the users.

ML

ML ML AWS Machine Learning

Building Meta’s GenAI Infrastructure

Hacker News

MARCH 12, 2024

Marking a major investment in Meta’s AI future, we are announcing two 24k GPU clusters. We are sharing details on the hardware, network, storage, design, performance, and software that help us extract high throughput and reliability for various AI workloads. We use this cluster design for Llama 3 training.

Clustering

Clustering AI AI ML

Adding Explainability to Clustering

Your guide to generative AI and ML at AWS re:Invent 2024

Webinars

Trending Sources

Speed up your cluster procurement time with Amazon SageMaker HyperPod training plans

Webinars

Racing into the future: How AWS DeepRacer fueled my AI and ML journey

Ray: Your Gateway to Scalable AI and Machine Learning Applications

Identification of Hazardous Areas for Priority Landmine Clearance: AI for Humanitarian Mine Action

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

Reduce ML training costs with Amazon SageMaker HyperPod

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Real value, real time: Production AI with Amazon SageMaker and Tecton

How climate tech startups are building foundation models with Amazon SageMaker HyperPod

Hammerspace Unveils the Fastest File System in the World for Training Enterprise AI Models at Scale

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

How Lumi streamlines loan approvals with Amazon SageMaker AI

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Unleash AI innovation with Amazon SageMaker HyperPod

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

How To Enhance Your Analytics with Insightful ML Approaches

This AI can predict genetic mutations before they happen

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

Ray jobs on Amazon SageMaker HyperPod: scalable and resilient distributed AI

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

KNNs & K-Means: The Superior Alternative to Clustering & Classification.

Accelerate pre-training of Mistral’s Mathstral model with highly resilient clusters on Amazon SageMaker HyperPod

How Aetion is using generative AI and Amazon Bedrock to unlock hidden insights about patient populations

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

Build scalable containerized RAG based generative AI applications in AWS using Amazon EKS with Amazon Bedrock

Meta’s open AI hardware vision

Integrate HyperPod clusters with Active Directory for seamless multi-user login

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

OpenSearch Vector Engine is now disk-optimized for low cost, accurate vector search

Maintaining large-scale AI capacity at Meta

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Elevating ML to new heights with distributed learning

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

Optimizing costs of generative AI applications on AWS

From innovation to impact: How AWS and NVIDIA enable real-world generative AI success

Keeping up with ML Research: A Tool to Navigate the ML Innovation Maze

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

Building Meta’s GenAI Infrastructure

Stay Connected