2018 and AWS - Data Science Current

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Build a medical imaging AI inference pipeline with MONAI Deploy on AWS

AWS Machine Learning Blog

NOVEMBER 8, 2023

AWS and NVIDIA have come together to make this vision a reality. AWS, NVIDIA, and other partners build applications and solutions to make healthcare more accessible, affordable, and efficient by accelerating cloud connectivity of enterprise imaging. AHI provides API access to ImageSet metadata and ImageFrames.

AWS

AWS AI AI ML

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

AWS Machine Learning Blog

SEPTEMBER 19, 2023

Implementing a multi-modal agent with AWS consolidates key insights from diverse structured and unstructured data on a large scale. All this is achieved using AWS services, thereby increasing the financial analyst’s efficiency to analyze multi-modal financial data (text, speech, and tabular data) holistically.

AWS

AWS AI AI ML

Webinars

The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufacturing

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

How To Get Promoted In Product Management

MORE WEBINARS

Bundesliga Match Fact Keeper Efficiency: Comparing keepers’ performances objectively using machine learning on AWS

AWS Machine Learning Blog

MARCH 30, 2023

Not only was he widely considered the top-rated goalkeeper in the league during the 2021/22 season, but he also held that title back in 2018/19 when Eintracht Frankfurt reached the Europa League semifinals. The BMF logic itself (except for the ML model) runs on an AWS Fargate container.

Machine Learning

Machine Learning Machine Learning AWS Apache Kafka

Deploy large language models for a healthtech use case on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 6, 2024

We implemented the solution using the AWS Cloud Development Kit (AWS CDK). One of the more popular and useful of the transformer architectures, Bidirectional Encoder Representations from Transformers (BERT), is a language representation model that was introduced in 2018. The first GPT model was introduced in 2018 by OpenAI.

AWS

AWS ML ML Data Preparation

Industry Pulse April 2018 Highlights

DataRobot Blog

MAY 6, 2018

This past month we had news from SAS Global Forum, Microstrategy, Oracle, AWS, Google, Qlik Qonnections, Tableau and several other smaller vendors. by Jen Underwood. Fallout from the March Facebook scandal continued while GDPR. Read More.

Tableau

Tableau AWS

Incorporate offline and online human – machine workflows into your generative AI applications on AWS

AWS Machine Learning Blog

MAY 14, 2024

We present the solution and provide an example by simulating a case where the tier one AWS experts are notified to help customers using a chat-bot. We provide LangChain and AWS SDK code-snippets, architecture and discussions to guide you on this important topic. Here, we use the on-demand option.

AWS

AWS AI AI Machine Learning

A Glimpse into the Unprecedented Growth of NVIDIA in the World of AI

Data Science Dojo

MARCH 4, 2024

RTX Series and Ray Tracing: In 2018, NVIDIA enhanced the capabilities of its GPUs with real-time ray tracing, known as the RTX Series. Collaborations with leading tech giants – AWS, Microsoft, and Google among others – paved the way to expand NVIDIA’s influence in the AI market.

Deep Learning

Deep Learning Deep Learning AI AI

Predict football punt and kickoff return yards with fat-tailed distribution using GluonTS

Flipboard

FEBRUARY 2, 2023

There are around 3,000 and 4,000 plays from four NFL seasons (2018–2021) for punt and kickoff plays, respectively. Models were trained and cross-validated on the 2018, 2019, and 2020 seasons and tested on the 2021 season. He works with AWS customers to solve business problems with artificial intelligence and machine learning.

Cross Validation

Cross Validation ML ML Machine Learning

10 edge computing innovators to keep an eye on in 2023

Dataconomy

APRIL 26, 2023

The growing demand for edge computing services is driving innovation and competition among edge computing companies Aarna Networks Aarna Networks , established in 2018, is striving to simplify edge orchestration for enterprises by offering private 5G and enterprise edge computing application automation software.

Internet of Things

Internet of Things Azure AWS Cloud Computing

Recommend top trending items to your users using the new Amazon Personalize recipe

AWS Machine Learning Blog

MARCH 30, 2023

Choose the new aws-trending-now recipe. For Solution version ID , choose the solution version that uses the aws-trending-now recipe. You can delete filters, recommenders, datasets, and dataset groups via the AWS Management Console or using the Python SDK. Applied AI Specialist Architect at AWS.

AWS

AWS ML ML Machine Learning

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AWS Machine Learning Blog

APRIL 19, 2023

Since 2018, our team has been developing a variety of ML models to enable betting products for NFL and NCAA football. It also includes support for new hardware like ARM (both in servers like AWS Graviton and laptops with Apple M1 ) and AWS Inferentia. Business requirements We are the US squad of the Sportradar AI department.

ML

ML ML Deep Learning Deep Learning

How Marubeni is optimizing market decisions using AWS machine learning and analytics

AWS Machine Learning Blog

MARCH 8, 2023

In this post, you will learn how Marubeni is optimizing market decisions by using the broad set of AWS analytics and ML services, to build a robust and cost-effective Power Bid Optimization solution. AWS Step Functions to orchestrate both the data and ML pipelines. Manager Data Science at Marubeni Power International.

AWS

AWS Machine Learning Machine Learning Analytics

Beyond data: Cloud analytics mastery for business brilliance

Dataconomy

SEPTEMBER 4, 2023

For instance, British Airways faced a fine of £183 million ($230 million) for a GDPR breach in 2018. Downtime, like the AWS outage in 2017 that affected several high-profile websites, can disrupt business operations. Cloud platforms like AWS, Azure, and Google Cloud offer scalable resources that can be provisioned on-demand.

Analytics

Analytics Analytics Big Data Analytics Big Data Analytics

Predicting new and existing product sales in semiconductors using Amazon Forecast

AWS Machine Learning Blog

APRIL 6, 2023

& AWS Machine Learning Solutions Lab (MLSL) Machine learning (ML) is being used across a wide range of industries to extract actionable insights from data to streamline processes and improve revenue generation. We trained three models using data from 2011–2018 and predicted the sales values until 2021.

Machine Learning

Machine Learning Machine Learning ML ML

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

AWS Machine Learning Blog

AUGUST 16, 2023

The images document the land cover, or physical surface features, of ten European countries between June 2017 and May 2018. Because we use true color images during DINO training, we only upload the red (B04), green (B03), and blue (B02) bands: aws s3 cp final_ben_s2.parquet Machine Learning Engineer at AWS. tif" --include "_B03.tif"

ML

ML ML AWS Data Scientist

Present and future of data cubes: an European EO perspective

Mlearning.ai

JANUARY 26, 2023

BUILDING EARTH OBSERVATION DATA CUBES ON AWS. 2018, July). In IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symposium (pp. AWS , GCP , Azure , CreoDIAS , for example, are not open-source, nor are they “standard”. Big ones can: AWS is benefiting a lot from these concepts. Data, 4(3), 92.

AWS

AWS Database Clean Data Data Science

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

AWS Machine Learning Blog

MAY 31, 2023

Getting AWS Certified can help you propel your career, whether you’re looking to find a new role, showcase your skills to take on a new project, or become your team’s go-to expert. Reading the FAQ page of the AWS services relevant for your certification exam is important in order to acquire a deeper understanding of the service.

AWS

AWS ML ML Python

Create high-quality datasets with Amazon SageMaker Ground Truth and FiftyOne

AWS Machine Learning Blog

MAY 5, 2023

This is a joint post co-written by AWS and Voxel51. To learn more about Ground Truth, refer to Label Data , Amazon SageMaker Data Labeling FAQs , and the AWS Machine Learning Blog. Voxel51 is the company behind FiftyOne, the open-source toolkit for building high-quality datasets and computer vision models. Join the FiftyOne community!

Machine Learning

Machine Learning Machine Learning AWS ML

Instruction fine-tuning for FLAN T5 XL with Amazon SageMaker Jumpstart

AWS Machine Learning Blog

MAY 22, 2023

Prerequisites To get started, all you need is an AWS account in which you can use Studio. About the authors Laurent Callot is a Principal Applied Scientist and manager at AWS AI Labs who has worked on a variety of machine learning problems, from foundational models and generative AI to forecasting, anomaly detection, causality, and AI Ops.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

In this post we highlight how the AWS Generative AI Innovation Center collaborated with the AWS Professional Services and PGA TOUR to develop a prototype virtual assistant using Amazon Bedrock that could enable fans to extract information about any event, player, hole or shot level details in a seamless interactive manner.

SQL

SQL AWS AI AI

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

AWS Machine Learning Blog

JANUARY 13, 2023

To mitigate these challenges, we propose a federated learning (FL) framework, based on open-source FedML on AWS, which enables analyzing sensitive HCLS data. In this two-part series, we demonstrate how you can deploy a cloud-based FL framework on AWS. For Account ID , enter the AWS account ID of the owner of the accepter VPC.

AWS

AWS Analytics Analytics Machine Learning

Question answering using Retrieval Augmented Generation with foundation models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 2, 2023

There are a few limitations of using off-the-shelf pre-trained LLMs: They’re usually trained offline, making the model agnostic to the latest information (for example, a chatbot trained from 2011–2018 has no information about COVID-19). Managed Spot Training is supported in all AWS Regions where Amazon SageMaker is currently available.

Algorithm

Algorithm Machine Learning Machine Learning Natural Language Processing

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

Chatbots Life

MAY 16, 2023

Master of Code Global (MOCG) is a certified partner of Microsoft and AWS and has been recognized by LivePerson, Inc. The Bot Forge Year Founded : 2018 HQ : Buckinghamshire, UK Team Size : 2–10 employees Clients : Rossano Ferretti, Help For Heroes, BNP Paribas, Skin Check Champions, EcoATM, Customs Clearance Consortium.

AI

AI AI Natural Language Processing Artificial Intelligence

You’re not alone in the cyber battlefield

Dataconomy

SEPTEMBER 6, 2023

GDPR (General Data Protection Regulation) is a comprehensive data privacy regulation in the European Union (EU) that went into effect on May 25, 2018. The audit assesses the organization’s ISMS against the requirements of the standard, and if successful, the organization is issued a certificate of compliance. What is GDPR?

Azure

Azure AWS

Meet the 2021 Iron Viz Finalists, get your Supporter Kit and let the games begin

Tableau

OCTOBER 11, 2021

Pradeep Kumar G : I have been entering Iron Viz since 2018 (feeder 3). Pradeep Kumar G : When I started my Tableau journey in early 2018, Bird Strikes Redoux was the first viz I looked for inspiration from Tableau Public. Samuel Parsons : Aw, come on! It educated me that a minimalist design could convey a powerful story.

Tableau

Tableau Data Visualization AWS

Meet the 2021 Iron Viz Finalists, get your Supporter Kit and let the games begin

Tableau

OCTOBER 11, 2021

Pradeep Kumar G : I have been entering Iron Viz since 2018 (feeder 3). Pradeep Kumar G : When I started my Tableau journey in early 2018, Bird Strikes Redoux was the first viz I looked for inspiration from Tableau Public. Samuel Parsons : Aw, come on! It educated me that a minimalist design could convey a powerful story.

Tableau

Tableau Data Visualization AWS

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

IBM Journey to AI blog

JANUARY 10, 2023

This demonstration then compares the current flight delay data (January 2019 – June 2022) with historical flight delay data (June 2003 – December 2018) to understand if the flight delays experienced in 2022 are occurring with more frequency or simply following a historical pattern. Figure 1 – NPS database table definitions.

Data Warehouse

Data Warehouse Data Analysis Data Analysis SQL

Customizing coding companions for organizations

AWS Machine Learning Blog

NOVEMBER 9, 2023

In these two studies, commissioned by AWS, developers were asked to create a medical software application in Java that required use of their internal libraries. About the authors Qing Sun is a Senior Applied Scientist in AWS AI Labs and work on AWS CodeWhisperer, a generative AI-powered coding assistant.

AWS

AWS Natural Language Processing K-nearest Neighbors AI

How to implement the General Data Protection Regulation (GDPR)

IBM Journey to AI blog

FEBRUARY 23, 2024

The General Data Protection Regulation (GDPR), the European Union’s landmark data privacy law, took effect in 2018. Learn how IBM Guardium® Data Protection automatically discovers, classifies, and protects sensitive data across major repositories like AWS, DBaaS, and on-premises mainframes. billion fine in 2023.

Data Governance

Data Governance AWS Database

Generate a counterfactual analysis of corn response to nitrogen with Amazon SageMaker JumpStart solutions

AWS Machine Learning Blog

APRIL 3, 2023

Prerequisites You need an AWS account to use this solution. To run this JumpStart 1P Solution and have the infrastructure deployed to your AWS account, you need to create an active Amazon SageMaker Studio instance (refer to Onboard to Amazon SageMaker Domain ).

Database

Database AWS Machine Learning Machine Learning

A Decade of Have I Been Pwned

Hacker News

DECEMBER 3, 2023

Back in 2018, Gizmodo reckoned HIBP was one of the top 100 websites that shaped the internet as we knew it , alongside the likes of Wikipedia, Google, Amazon and Goatse (don't Google it). aw man, thanks The Register! And then ensured could never happen again.

AWS

Amazon Textract’s new Layout feature introduces efficiencies in general purpose and generative AI document processing tasks

AWS Machine Learning Blog

NOVEMBER 21, 2023

Anjan is part of the worldwide AI services specialist team and works with customers to help them understand and develop solutions to business problems with AWS AI Services and generative AI. She is focused on building machine learning–based services for AWS customers. In her spare time, Lalita likes to play board games and go on hikes.

AI

AI AI Database AWS

Identifying defense coverage schemes in NFL’s Next Gen Stats

AWS Machine Learning Blog

FEBRUARY 10, 2023

Quantitative evaluation We utilize 2018–2020 season data for model training and validation, and 2021 season data for model evaluation. Prior to AWS, he obtained his MCS from West Virginia University and worked as computer vision researcher at Midea. Each season consists of around 17,000 plays. She received her Ph.D.

ML

ML ML Machine Learning Machine Learning

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 8, 2023

About the Authors Mohan Gandhi is a Senior Software Engineer at AWS. He has been with AWS for the last 10 years and has worked on various AWS services like EMR, EFA and RDS. Venkatesh Krishnan leads Product Management for Amazon SageMaker in AWS. format(str(round(time.time()))) resource_id = "endpoint/{}/variant/{}".format(endpoint_name,

Machine Learning

Machine Learning Machine Learning ML ML

How HSR.health is limiting risks of disease spillover from animals to humans using Amazon SageMaker geospatial capabilities

AWS Machine Learning Blog

FEBRUARY 5, 2024

round(2) return(layer) The following figure on the left shows the aggregation of the image classification from the test area scene in northern Peru aggregated to the district administrative level with the calculated change in the forest area between 2018–2023. Emmett joined AWS in 2020 and is based in Austin, TX. min()) * 100).round(2)

ML

ML ML AWS Support Vector Machines

The history of Kubernetes

IBM Journey to AI blog

NOVEMBER 2, 2023

These tech pioneers were looking for ways to bring Google’s internal infrastructure expertise into the realm of large-scale cloud computing and also enable Google to compete with Amazon Web Services (AWS)—the unrivaled leader among cloud providers at the time.

Clustering

Clustering Cloud Computing AWS

spaCy meets Transformers: Fine-tune BERT, XLNet and GPT-2

Explosion

AUGUST 1, 2019

Based on the (fairly vague) marketing copy, AWS might be doing something similar in SageMaker. 2018) in using the vector for the class token to represent the sentence, and passing this vector forward into a softmax layer in order to perform classification. in accuracy depending on the task and dataset. and follows Devlin et al.

Natural Language Processing

Natural Language Processing AWS Machine Learning Machine Learning

$100M+ ARR: Alation Achieves Centaur Status

Alation

SEPTEMBER 30, 2022

Our ability to catalog every data asset means that we can partner with other ISVs in data quality and observability, like BigEye and Soda ; privacy, like BigID and OneTrust; access governance, like Immuta and Privacera; not to mention the core platforms, like Snowflake , Databricks , AWS , GCP, and Azure. Subscribe to Alation's Blog.

Data Governance

Data Governance Azure SQL Data Quality

Cash Flows in Crypto

Ocean Protocol

AUGUST 14, 2023

In the current state of the web, businesses bear the costs of computing and storage provided by cloud computing services like Amazon Web Services (AWS). Trading Volume: Uniswap, since launching in 2018, has facilitated ~$1.5 Gas fees are paid by users of DApps to have their transactions computed and stored on the Ethereum ledger.

Cloud Computing

Cloud Computing Data Scientist AWS AI

A Guide to Data Analytics in the Travel Industry

Alation

MARCH 21, 2023

When it embarked on a digital transformation and modernization initiative in 2018, the company migrated all its data to AWS S3 Data Lake and Snowflake Data Cloud to provide accessibility to data to all users. Using Alation, ARC automated the data curation and cataloging process.

Analytics

Analytics Analytics Data Silos Big Data

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Generative AI in the Enterprise

O'Reilly Media

NOVEMBER 28, 2023

How will AI adopters react when the cost of renting infrastructure from AWS, Microsoft, or Google rises? We haven’t found the source, though in 2018, Gartner wrote that 85% of AI projects “deliver erroneous outcomes.” That’s not the same as failure, and 2018 significantly predates generative AI.

AI

AI AI Data Analysis Data Analysis

Harness large language models in fake news detection

AWS Machine Learning Blog

NOVEMBER 14, 2023

The solution also uses Amazon Bedrock , a fully managed service that makes foundation models (FMs) from Amazon and third-party model providers accessible through the AWS Management Console and APIs. or higher installed on either Linux, Mac, or a Windows Subsystem for Linux and an AWS account.

Computer Science

Computer Science Computer Science AWS Python

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Build a medical imaging AI inference pipeline with MONAI Deploy on AWS

Webinars

Trending Sources

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

Webinars

Bundesliga Match Fact Keeper Efficiency: Comparing keepers’ performances objectively using machine learning on AWS

Deploy large language models for a healthtech use case on Amazon SageMaker

Industry Pulse April 2018 Highlights

Incorporate offline and online human – machine workflows into your generative AI applications on AWS

A Glimpse into the Unprecedented Growth of NVIDIA in the World of AI

Predict football punt and kickoff return yards with fat-tailed distribution using GluonTS

10 edge computing innovators to keep an eye on in 2023

Recommend top trending items to your users using the new Amazon Personalize recipe

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

How Marubeni is optimizing market decisions using AWS machine learning and analytics

Beyond data: Cloud analytics mastery for business brilliance

Predicting new and existing product sales in semiconductors using Amazon Forecast

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

Present and future of data cubes: an European EO perspective

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

Create high-quality datasets with Amazon SageMaker Ground Truth and FiftyOne

Instruction fine-tuning for FLAN T5 XL with Amazon SageMaker Jumpstart

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

Question answering using Retrieval Augmented Generation with foundation models in Amazon SageMaker JumpStart

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

You’re not alone in the cyber battlefield

Meet the 2021 Iron Viz Finalists, get your Supporter Kit and let the games begin

Meet the 2021 Iron Viz Finalists, get your Supporter Kit and let the games begin

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

Customizing coding companions for organizations

How to implement the General Data Protection Regulation (GDPR)

Generate a counterfactual analysis of corn response to nitrogen with Amazon SageMaker JumpStart solutions

A Decade of Have I Been Pwned

Amazon Textract’s new Layout feature introduces efficiencies in general purpose and generative AI document processing tasks

Identifying defense coverage schemes in NFL’s Next Gen Stats

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

How HSR.health is limiting risks of disease spillover from animals to humans using Amazon SageMaker geospatial capabilities

The history of Kubernetes

spaCy meets Transformers: Fine-tune BERT, XLNet and GPT-2

$100M+ ARR: Alation Achieves Centaur Status

Cash Flows in Crypto

A Guide to Data Analytics in the Travel Industry

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Generative AI in the Enterprise

Harness large language models in fake news detection

Stay Connected