2019

article thumbnail

An Exhaustive Guide to Detecting and Fighting Neural Fake News using NLP

Analytics Vidhya

Overview Neural fake news (fake news generated by AI) can be a huge issue for our society This article discusses different Natural Language Processing. The post An Exhaustive Guide to Detecting and Fighting Neural Fake News using NLP appeared first on Analytics Vidhya.

article thumbnail

Open Source Projects by Google, Uber and Facebook for Data Science and AI

KDnuggets

Open source is becoming the standard for sharing and improving technology. Some of the largest organizations in the world namely: Google, Facebook and Uber are open sourcing their own technologies that they use in their workflow to the public.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

5 Weird and Hilarious Uses of Data Science

Analytics Vidhya

Introduction “Ripley’s Believe or Not” features some of the weirdest and most bizarre facts from around the world. How about creating our own Ripley’s. The post 5 Weird and Hilarious Uses of Data Science appeared first on Analytics Vidhya.

article thumbnail

Getting Started with Automated Text Summarization

KDnuggets

This article will walk through an extractive text summarization process, using a simple word frequency approach, implemented in Python.

Python 400
article thumbnail

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Speaker: Jason Chester, Director, Product Management

In today’s manufacturing landscape, staying competitive means moving beyond reactive quality checks and toward real-time, data-driven process control. But what does true manufacturing process optimization look like—and why is it more urgent now than ever? Join Jason Chester in this new, thought-provoking session on how modern manufacturers are rethinking quality operations from the ground up.

article thumbnail

10 Best and Free Machine Learning Courses, Online

KDnuggets

Getting ready to leap into the world of Data Science? Consider these top machine learning courses curated by experts to help you learn and thrive in this exciting field.

article thumbnail

A Doomed Marriage of Machine Learning and Agile

KDnuggets

Sebastian Thrun, the founder of Udacity, ruined my machine learning project and wedding.

More Trending

article thumbnail

5 Key Reasons Why Data Scientists Are Quitting their Jobs

Analytics Vidhya

Introduction The stock of a data scientist is at an all-time high right now. There aren’t too many professions out there that can rival. The post 5 Key Reasons Why Data Scientists Are Quitting their Jobs appeared first on Analytics Vidhya.

article thumbnail

Random Forest vs Neural Networks for Predicting Customer Churn

KDnuggets

Let us see how random forest competes with neural networks for solving a real world business problem.

Analytics 392
article thumbnail

Data Science Curriculum Roadmap

KDnuggets

What follows is a set of broad recommendations, and it will inevitably require a lot of adjustments in each implementation. Given that caveat, here are our curriculum recommendations.

article thumbnail

Market Basket Analysis: A Tutorial

KDnuggets

This article is about Market Basket Analysis & the Apriori algorithm that works behind it.

Algorithm 381
article thumbnail

Airflow Best Practices for ETL/ELT Pipelines

Speaker: Kenten Danas, Senior Manager, Developer Relations

ETL and ELT are some of the most common data engineering use cases, but can come with challenges like scaling, connectivity to other systems, and dynamically adapting to changing data sources. Airflow is specifically designed for moving and transforming data in ETL/ELT pipelines, and new features in Airflow 3.0 like assets, backfills, and event-driven scheduling make orchestrating ETL/ELT pipelines easier than ever!

article thumbnail

What is the most important question for Data Science (and Digital Transformation)

KDnuggets

With so many buzzwords surrounding AI and machine learning, understanding which can bring business value and which are best left in the lab to mature is difficult. While machine learning offers significant power in driving digital transformations, a business must start with the right questions and leave the math to the development teams.

article thumbnail

6 Challenging Open Source Data Science Projects to Make you a Better Data Scientist

Analytics Vidhya

Overview Here are 6 challenging open-source data science projects to level up your data scientist skillset There are some intriguing data science projects, including. The post 6 Challenging Open Source Data Science Projects to Make you a Better Data Scientist appeared first on Analytics Vidhya.

article thumbnail

Explainability: Cracking open the black box, Part 1

KDnuggets

What is Explainability in AI and how can we leverage different techniques to open the black box of AI and peek inside? This practical guide offers a review and critique of the various techniques of interpretability.

AI 357
article thumbnail

Interpretability part 3: LIME and SHAP

KDnuggets

The third part in a series on leveraging techniques to take a look inside the black box of AI, this guide considers methods that try to explain each prediction instead of establishing a global explanation.

AI 357
article thumbnail

Whats New in Apache Airflow 3.0 –– And How Will It Reshape Your Data Workflows?

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

The 4 fastest ways not to get hired as a data scientist

KDnuggets

Ready to try to get hired as a data scientist for the first time? Avoiding these common mistakes won’t guarantee an offer, but not avoiding them is a sure fire way for your application to be tossed into the trash bin.

article thumbnail

Top KDnuggets tweets, Nov 20-26: How to Speed up Pandas by 4x with one line of code

KDnuggets

Also: Deep Learning for Image Classification with Less Data; How to Speed up Pandas by 4x with one line of code; 25 Useful #Python Snippets to Help in Your Day-to-Day Work; Automated Machine Learning Project Implementation Complexities.

article thumbnail

What is a Data Scientist Worth?

KDnuggets

What is the Salary of a Data Scientist in 2019? Let's have a look at some data to see how we can answer that question.

article thumbnail

Automatic Text Summarization in a Nutshell

KDnuggets

Marketing scientist Kevin Gray asks Dr. Anna Farzindar of the University of Southern California about Automatic Text Summarization and the various ways it is used.

Analytics 333
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

The Essential Toolbox for Data Cleaning

KDnuggets

Increase your confidence to perform data cleaning with a broader perspective of what datasets typically look like, and follow this toolbox of code snipets to make your data cleaning process faster and more efficient.

article thumbnail

10 Exciting Real-World Applications of AI in Retail

Analytics Vidhya

Overview The rise of artificial intelligence (AI) has disrupted many industries in recent years One of the most impacted industries – retail! Retail operations. The post 10 Exciting Real-World Applications of AI in Retail appeared first on Analytics Vidhya.

article thumbnail

How to use a Machine Learning Model to Make Predictions on Streaming Data using PySpark

Analytics Vidhya

Overview Streaming data is a thriving concept in the machine learning space Learn how to use a machine learning model (such as logistic regression). The post How to use a Machine Learning Model to Make Predictions on Streaming Data using PySpark appeared first on Analytics Vidhya.

article thumbnail

What is the Chi-Square Test and How Does it Work? An Intuitive Explanation with R Code

Analytics Vidhya

Overview What is the chi-square test? How does it work? Learn about the different types of Chi-Square tests and where and when you should. The post What is the Chi-Square Test and How Does it Work? An Intuitive Explanation with R Code appeared first on Analytics Vidhya.

Analytics 319
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

Alternative Cloud Hosted Data Science Environments

KDnuggets

Over the years new alternative providers have risen to provided a solitary data science environment hosted on the cloud for data scientist to analyze, host and share their work.

article thumbnail

A Non-Technical Reading List for Data Science

KDnuggets

The world still cannot be reduced to numbers on a page because human beings are still the ones making all the decisions. So, the best data scientists understand the numbers and the people. Check out these great data science books that will make you a better data scientist without delving into the technical details.

article thumbnail

Game (Theory) for AI? An Illustrated Guide for Everyone

Analytics Vidhya

Overview What is Game Theory? And how does it apply to artificial intelligence (AI)? Game theory for AI is a fascinating concept that we. The post Game (Theory) for AI? An Illustrated Guide for Everyone appeared first on Analytics Vidhya.

article thumbnail

10 Free Must-read Books on AI

KDnuggets

Artificial Intelligence continues to fill the media headlines while scientists and engineers rapidly expand its capabilities and applications. With such explosive growth in the field, there is a great deal to learn. Dive into these 10 free books that are must-reads to support your AI study and work.

article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

Plotnine: Python Alternative to ggplot2

KDnuggets

Python's plotting libraries such as matplotlib and seaborn does allow the user to create elegant graphics as well, but lack of a standardized syntax for implementing the grammar of graphics compared to the simple, readable and layering approach of ggplot2 in R makes it more difficult to implement in Python.

Python 307
article thumbnail

How to Speed up Pandas by 4x with one line of code

KDnuggets

While Pandas is the library for data processing in Python, it isn't really built for speed. Learn more about the new library, Modin, developed to distribute Pandas' computation to speedup your data prep.

Python 307
article thumbnail

How YouTube is Recommending Your Next Video

KDnuggets

If you are interested in learning more about the latest Youtube recommendation algorithm paper, read this post for details on its approach and improvements.

Algorithm 308
article thumbnail

Everything a Data Scientist Should Know About Data Management

KDnuggets

For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.