Data Science Current

No "Zero-Shot" Without Exponential Data

Hacker News

MAY 9, 2024

Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation.

A defined process for project post mortem review (1996)

Hacker News

NOVEMBER 15, 2023

The authors propose a tentative, standard process for conducting post mortem reviews and describe activities, roles, and artifacts of the process. Participants are empowered when they know that each issue raised during the post mortem process must be added to the risk database and evaluated methodically on each subsequent project.

Database

Are Model Explanations Useful in Practice? Rethinking How to Support Human-ML Interactions.

ML @ CMU

MARCH 31, 2023

Our work further motivates novel directions for developing and evaluating tools to support human-ML interactions. For example, explanations are thought to assist model developers in identifying when models rely on spurious artifacts and to aid domain experts in determining whether to follow a model’s prediction.

ML

ML ML Algorithm Machine Learning

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

How VMware built an MLOps pipeline from scratch using GitLab, Amazon MWAA, and Amazon SageMaker

Flipboard

MARCH 13, 2023

It is critical for the VMware Carbon Black team to design and build a custom end-to-end MLOps pipeline that orchestrates and automates workflows in the ML lifecycle and enables model training, evaluations, and deployments. The following architecture diagram illustrates the end-to-end workflow and the components involved in our MLOps pipeline.

ML

ML ML AWS Data Scientist

Are Model Explanations Useful in Practice? Rethinking How to Support Human-ML Interactions.

ML @ CMU

MARCH 30, 2023

Our work further motivates novel directions for developing and evaluating tools to support human-ML interactions. For example, explanations are thought to assist model developers in identifying when models rely on spurious artifacts and to aid domain experts in determining whether to follow a model’s prediction.

ML

ML ML Algorithm Machine Learning

Demystifying deepfake videos: The powerful fusion of technology and data science

Data Science Dojo

JUNE 5, 2023

Synthetic Artifacts: Look for strange artifacts or distortions in the video, such as unnatural lighting, inconsistent shadows, or pixelation. Audio Discrepancies: With the rise of its audio, it is essential to consider auditory cues when evaluating media authenticity. These anomalies can help identify potential fakes.

Data Science

Data Science Natural Language Processing Data Scientist Machine Learning

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

AWS Machine Learning Blog

FEBRUARY 20, 2024

In this collaboration, we deployed and evaluated WhisperX on SageMaker, using an asynchronous inference endpoint to host the model. In the following sections, we delve into the details of deploying the WhisperX model on SageMaker, and evaluate the diarization performance. __dict__[WAV2VEC2_MODEL].get_model(dl_kwargs={"model_dir":

AWS

AWS AI AI Machine Learning

Evaluating speech synthesis in many languages with SQuId

Google Research AI blog

JUNE 7, 2023

After developing a new model, one must evaluate whether the speech it generates is accurate and natural: the content must be relevant to the task, the pronunciation correct, the tone appropriate, and there should be no acoustic artifacts such as cracks or signal-correlated noise.

LLMOps: Experiment Tracking with MLflow for Large Language Models

DagsHub

AUGUST 19, 2023

The prompt has a significant effect on the behavior and output of the model and is also used in the evaluation stage. It will enable the engineer developing it to evaluate the result of the model based on a set of prompts, compare it to other outputs and gauge the model's performance. text, image, video, etc.).

AWS

AWS Machine Learning Machine Learning Data Science

LLMOps demystified: Why it’s crucial and best practices for 2023

Data Science Dojo

AUGUST 28, 2023

Model evaluation: Once the LLM is trained, it needs to be evaluated to see how well it performs. Model Management In LLMOps, efficient training, evaluation, and management of LLM models are paramount. Model fine-tuning Model training: Once the data is prepared, the LLM is trained.

Exploratory Data Analysis

Exploratory Data Analysis Data Preparation Machine Learning Machine Learning

Driving advanced analytics outcomes at scale using Amazon SageMaker powered PwC’s Machine Learning Ops Accelerator

AWS Machine Learning Blog

DECEMBER 19, 2023

Data and model management provide a central capability that governs ML artifacts throughout their lifecycle. ML model development allows various personas to develop a robust and reproducible model training pipeline, which comprises a sequence of steps, from data validation and transformation to model training and evaluation.

Machine Learning

Machine Learning Machine Learning AWS Analytics

Automatically generate impressions from findings in radiology reports using generative AI on AWS

AWS Machine Learning Blog

AUGUST 30, 2023

Overview of solution In this section, we discuss the key components of our solution: choosing the strategy for the task, fine-tuning an LLM, and evaluating the results. Evaluate the results When the training is complete, it’s critical to evaluate the results.

AWS

AWS AI AI ML

How to manage an end-to-end machine learning project with MLflow? part 1

Mlearning.ai

FEBRUARY 17, 2023

> MLflow has artifact storage. Each run will track the Code version, Start and end time of the run, Source, Key-value input parameters, Key-value metrics, and Artifacts. Recorded artifacts can be in any format such as images or data files. Save evaluation metrics as a dictionary.

Machine Learning

Machine Learning Machine Learning Python Database

Announcing the first Machine Unlearning Challenge

Google Research AI blog

JUNE 29, 2023

Fully erasing the influence of the data requested to be deleted is challenging since, aside from simply deleting it from databases where it’s stored, it also requires erasing the influence of that data on other artifacts such as trained machine learning models. The goal of the competition is twofold.

Algorithm

Algorithm Machine Learning Machine Learning Deep Learning

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

AWS Machine Learning Blog

SEPTEMBER 1, 2023

Specifically, we briefly introduce MLOps principles and focus on the main differentiators compared to FMOps and LLMOps regarding processes, people, model selection and evaluation, data privacy, and model deployment. The following figure illustrates the topics we discuss. Our approach applies to both open-source and proprietary models equally.

AI

AI AI ML ML

Production Machine Learning for Mission-Critical Applications

ODSC - Open Data Science

MAY 29, 2023

As you operate an ML model in a production environment you will accumulate artifacts such as versions of your model and dataset, and metrics and statistics for each version. All of these have introduced new artifacts — prompts, PEFT weights, fine-tuning datasets, etc — to the set of items that must be captured, managed, and tracked.

Machine Learning

Machine Learning Machine Learning ML ML

Computer Vision for Cultural Heritage Preservation: Unlocking the Past with Advanced Imaging…

Heartbeat

OCTOBER 9, 2023

This piece will examine how computer vision has been employed to protect and restore cultural heritage artifacts. Computer vision techniques enable us to generate accurate digital representations of artifacts previously thought to be lost. This simulated restoration approach helps preserve the physical artifacts.

Algorithm

Algorithm Deep Learning Deep Learning Machine Learning

The Data Cards Playbook: A Toolkit for Transparency in Dataset Documentation

Google Research AI blog

NOVEMBER 17, 2022

Data Cards are transparency artifacts that provide structured summaries of ML datasets with explanations of processes and rationale that shape the data and describe how the data may be used to train or evaluate models. For example, we've adapted Evaluation Gaps in ML practices into a worksheet for more complete dataset documentation.

ML

ML ML Data Governance Data Scientist

Using societal context knowledge to foster the responsible application of AI

Google Research AI blog

JULY 20, 2023

Last year we announced that Jigsaw , Google’s incubator for building technology that explores solutions to threats to open societies, leveraged our structured societal context knowledge approach during the data preparation and evaluation phases of model development to scale bias mitigation for their widely used Perspective API toxicity classifier.

AI

AI AI ML ML

Responsible AI at Google Research: The Impact Lab

Google Research AI blog

MARCH 16, 2023

We also offer research support to some of our organization’s most challenging efforts, including the 1,000 Languages Initiative and ongoing work in the testing and evaluation of language and generative models. We examine systemic social issues and generate useful artifacts for responsible AI development.

ML

ML ML AI AI

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

AWS Machine Learning Blog

DECEMBER 13, 2023

We create an automated model build pipeline that includes steps for data preparation, model training, model evaluation, and registration of the trained model in the SageMaker Model Registry. The model registry maintains records of model versions, their associated artifacts, lineage, and metadata. all implemented via CloudFormation.

AWS

AWS ML ML Data Preparation

Track and Visualize Information From Your Pipelines: neptune.ai + ZenML Integration

The MLOps Blog

APRIL 16, 2024

to share the artifacts and metrics logged by your pipelines with your team, organization, or external stakeholders. The pipeline consists of 4 simple steps, 2 of which use the Neptune-ZenML integration to log training and evaluation metadata. models, metrics, datasets). You’d like to connect ZenML to neptune.ai neptune.ai

ML

ML ML

Simplify continuous learning of Amazon Comprehend custom models using Comprehend flywheel

AWS Machine Learning Blog

MARCH 1, 2023

You can use a flywheel to orchestrate the tasks associated with training and evaluating new custom model versions. A flywheel iteration is a workflow that uses the new datasets to evaluate the active model version and to train a new model version. F1-score is an important evaluation metric in machine learning.

Data Lakes

Data Lakes AWS ML ML

Implementing MLOps practices with Amazon SageMaker JumpStart pre-trained models

Flipboard

FEBRUARY 15, 2023

We show how to build an end-to-end CI/CD pipeline for data preprocessing and fine-tuning ML models, registering model artifacts to the SageMaker model registry , and automating model deployment with a manual approval to stage and production. An S3 bucket is created for output model artifacts generated from the pipeline.

ML

ML ML AWS Natural Language Processing

Evaluate and Trace with LangSmith: Mastering LLM Optimization

Data Science Dojo

OCTOBER 7, 2023

In this blog, we delve into Large Language Model Evaluation and Tracing with LangSmith, emphasizing their pivotal role in ensuring application reliability and performance. Before we jump to understanding the LLM evaluation and tracing process, first have a brief understanding of LangSmith and LangChain. What is LangSmith and LangChain?

Python

Python Computer Science Computer Science

31 Questions that Shape Fortune 500 ML Strategy

Towards AI

JUNE 5, 2023

As such, my intention with this blog is not to duplicate those definitions but rather to encourage you to question and evaluate your current ML strategy. Key objectives: Before diving into the questions, it’s important to understand the evaluation lens through which they are written. Automation✓ The system must emphasize automation.✓

ML

ML ML Data Scientist EDA

Face Off: Practical Face-Swapping with Machine Learning

Towards AI

APRIL 3, 2024

Diversity attributes: skin-colours, disabilities, gender, age Typical datasets used for training and evaluation are: Celeb-HQ, CalebA-HQ, CelebV-HQ, VoxCeleb2, FaceForensics++ and FFHQ. SimSwap: An Efficient Framework For High Fidelity Face Swapping, 2021, code Image by Chen R.

Machine Learning

Machine Learning Machine Learning AI AI

Experiments with the ICML 2020 Peer-Review Process

Machine Learning (Theory)

DECEMBER 1, 2020

However, this requirement poses a risk of bias in reviewers’ evaluations. Reviewers give almost one point lower score (95% Confidence Interval: [0.24, 1.30]) on a 10-point Likert item for the overall evaluation of a paper when they are told that a paper is a resubmission. Research question. Implications. Research question.

ML

ML ML Machine Learning Machine Learning

Modernizing data science lifecycle management with AWS and Wipro

AWS Machine Learning Blog

JANUARY 5, 2024

Model training: Using the SageMaker SDK, this step runs training code with the respective model image and trains datasets from pre-processing scripts while generating the trained model artifacts. Save model: This step creates a model from the trained model artifacts. It then aggregates that data for further processing.

AWS

AWS Data Science ML ML

Why Upgrade to dbt Cloud over dbt Core?

phData

OCTOBER 12, 2022

This GraphQL API allows you to easily query various information about your dbt Cloud runs and projects and helps you evaluate not just your current data health , but your overall data health at a point in time. You can see this sort of information in action when you watch how a job is executed in dbt Cloud.

SQL

SQL Data Warehouse Data Visualization Data Quality

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Reproducibility and traceability must be enabled automatically by the end-to-end data processing pipelines, where many mandatory documentation artifacts, such as data lineage reports and model cards, can be prepared automatically. Compliance – The platform can also help improve regulatory compliance of AI-enabled propositions.

AWS

AWS ML ML AI

Create SageMaker Pipelines for training, consuming and monitoring your batch use cases

AWS Machine Learning Blog

APRIL 21, 2023

The first pipeline includes the steps needed to prepare data, train the model, and evaluate the performance of the model. If the model performs acceptably according to the evaluation criteria, the pipeline continues with a step to baseline the data using a built-in SageMaker Pipelines step. Evaluate the model. Train the model.

Data Quality

Data Quality ML ML AWS

Improve prediction quality in custom classification models with Amazon Comprehend

AWS Machine Learning Blog

OCTOBER 5, 2023

We use the Amazon Comprehend model training output artifacts like a confusion matrix to tune model performance and guide you on improving your training data. For this dataset, we tested how the model’s performance on the evaluation dataset changes as we provide more samples. On the New menu, choose Terminal.

Data Preparation

Data Preparation ML ML AWS

Slack delivers native and secure generative AI powered by Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2024

With SageMaker JumpStart, you can evaluate, compare, and select foundation models (FMs) quickly based on predefined quality and responsibility metrics to perform tasks like article summarization and image generation. None of your data is used to train the underlying models.

AWS

AWS AI AI ML

Pneumonia Diagnosis Detection with OpenCV

Heartbeat

APRIL 11, 2023

Artifacts are just one of the many tools in the Comet toolbox to help ease model management. Evaluate the model To determine how well the model generalizes to new data after training, we must evaluate its performance on a test set. To evaluate the model, we will make use of the dataset’s test collection.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

AWS Machine Learning Blog

AUGUST 16, 2023

The first step in our workflow is to prepare the BigEarthNet-S2 dataset for DINO training and evaluation. As before, we use the custom BigEarthNet dataset to load images during training and evaluation. We use the binary cross-entropy for the loss function and compute the average precision to evaluate the performance of the model.

ML

ML ML AWS Data Scientist

Accelerate machine learning time to value with Amazon SageMaker JumpStart and PwC’s MLOps accelerator

AWS Machine Learning Blog

MAY 23, 2023

Models are mathematical artifacts that take input data, perform calculations and computations on them, and generate predictions or inferences. You can benefit from all SageMaker features and functions, including model training, tuning, evaluation, deployment, and monitoring.

Machine Learning

Machine Learning Machine Learning AWS ML

An open-source gymnasium for machine learning assisted computer architecture design

Google Research AI blog

JULY 11, 2023

Posted by Amir Yazdanbakhsh, Research Scientist, and Vijay Janapa Reddi, Visiting Researcher, Google Research Computer Architecture research has a long history of developing simulators and tools to evaluate and shape the design of computer systems. It comprises two main components: 1) the ArchGym environment and 2) the ArchGym agent.

Machine Learning

Machine Learning Machine Learning ML ML

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Google Research AI blog

MARCH 30, 2023

This starting condition led to a model-centric paradigm in which (1) the training dataset and test dataset were “frozen” artifacts and the goal was to develop a better model, and (2) the test dataset was selected randomly from the same pool of data as the training set for statistical reasons.

ML

ML ML Algorithm Natural Language Processing

Where Do Data Catalogs Fit in Metadata Management?

Alation

FEBRUARY 13, 2020

In an earlier blog, I defined a data catalog as “a collection of metadata, combined with data management and search tools, that helps analysts and other data users to find the data that they need, serves as an inventory of available data, and provides information to evaluate fitness data for intended uses.”.

Data Lakes

Data Lakes Data Governance Data Science Data Analyst

Best practices for building secure applications with Amazon Transcribe

AWS Machine Learning Blog

MARCH 25, 2024

AWS Config enables you to assess, audit, and evaluate the configurations of your AWS resources. AWS uses third-party auditors to evaluate its services for compliance with various programs. AWS Artifact allows you to download third-party audit reports. You can monitor Amazon Transcribe using AWS CloudTrail and Amazon CloudWatch.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Unlock personalized experiences powered by AI using Amazon Personalize and Amazon OpenSearch Service

AWS Machine Learning Blog

FEBRUARY 29, 2024

To build Amazon Personalize artifacts in this post, we use a dataset from IMDb, the world’s most authoritative source for movie, TV, and celebrity content, available on AWS Marketplace, as well as the MovieLens dataset prepared by GroupLens research at the University of Minnesota, consisting of user rankings for various movies.

AWS

AWS AI AI ML

GPT-NeoXT-Chat-Base-20B foundation model for chatbot applications is now available on Amazon SageMaker

AWS Machine Learning Blog

MAY 16, 2023

Because the models are hosted and deployed on AWS, you can rest assured that your data, whether used for evaluating or using the model at scale, is never shared with third parties. Retrieve artifacts and deploy an endpoint. The inference script is prepacked with the model artifact. Select a pre-trained model.

AWS

AWS Machine Learning Machine Learning ML

Image Augmentation: A Fun and Easy Way to Improve Computer Vision Models

Heartbeat

MARCH 5, 2024

The augmented photos can occasionally have artifacts due to image augmentation : These artifacts may be brought on by the individual data augmentation methods employed or by the computer vision model's inherent constraints.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

No "Zero-Shot" Without Exponential Data

A defined process for project post mortem review (1996)

Webinars

Trending Sources

Are Model Explanations Useful in Practice? Rethinking How to Support Human-ML Interactions.

Webinars

How VMware built an MLOps pipeline from scratch using GitLab, Amazon MWAA, and Amazon SageMaker

Are Model Explanations Useful in Practice? Rethinking How to Support Human-ML Interactions.

Demystifying deepfake videos: The powerful fusion of technology and data science

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

Evaluating speech synthesis in many languages with SQuId

LLMOps: Experiment Tracking with MLflow for Large Language Models

LLMOps demystified: Why it’s crucial and best practices for 2023

Driving advanced analytics outcomes at scale using Amazon SageMaker powered PwC’s Machine Learning Ops Accelerator

Automatically generate impressions from findings in radiology reports using generative AI on AWS

How to manage an end-to-end machine learning project with MLflow? part 1

Announcing the first Machine Unlearning Challenge

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

Production Machine Learning for Mission-Critical Applications

Computer Vision for Cultural Heritage Preservation: Unlocking the Past with Advanced Imaging…

The Data Cards Playbook: A Toolkit for Transparency in Dataset Documentation

Using societal context knowledge to foster the responsible application of AI

Responsible AI at Google Research: The Impact Lab

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

Track and Visualize Information From Your Pipelines: neptune.ai + ZenML Integration

Simplify continuous learning of Amazon Comprehend custom models using Comprehend flywheel

Implementing MLOps practices with Amazon SageMaker JumpStart pre-trained models

Evaluate and Trace with LangSmith: Mastering LLM Optimization

31 Questions that Shape Fortune 500 ML Strategy

Face Off: Practical Face-Swapping with Machine Learning

Experiments with the ICML 2020 Peer-Review Process

Modernizing data science lifecycle management with AWS and Wipro

Why Upgrade to dbt Cloud over dbt Core?

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

Create SageMaker Pipelines for training, consuming and monitoring your batch use cases

Improve prediction quality in custom classification models with Amazon Comprehend

Slack delivers native and secure generative AI powered by Amazon SageMaker JumpStart

Pneumonia Diagnosis Detection with OpenCV

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

Accelerate machine learning time to value with Amazon SageMaker JumpStart and PwC’s MLOps accelerator

An open-source gymnasium for machine learning assisted computer architecture design

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Where Do Data Catalogs Fit in Metadata Management?

Best practices for building secure applications with Amazon Transcribe

Unlock personalized experiences powered by AI using Amazon Personalize and Amazon OpenSearch Service

GPT-NeoXT-Chat-Base-20B foundation model for chatbot applications is now available on Amazon SageMaker

Image Augmentation: A Fun and Easy Way to Improve Computer Vision Models

Stay Connected