March, 2020

article thumbnail

10 Awesome Data Manipulation and Wrangling Hacks, Tips and Tricks

Analytics Vidhya

Introduction “Efficiency is doing things right. Effectiveness is doing the right thing.” – Zig Zagler As data scientists, we are often taught to be. The post 10 Awesome Data Manipulation and Wrangling Hacks, Tips and Tricks appeared first on Analytics Vidhya.

article thumbnail

20+ Machine Learning Datasets & Project Ideas

KDnuggets

Upgrading your machine learning, AI, and Data Science skills requires practice. To practice, you need to develop models with a large amount of data. Finding good datasets to work with can be challenging, so this article discusses more than 20 great datasets along with machine learning project ideas for you to tackle today.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Calling the global data science community to #HACKCORONA

Dataconomy

COVID-19 is still spreading exponentially throughout the world. Current statistics indicate that 15-20% of people who get it require hospitalization for respiratory failure for multiple weeks. The hardship falls on elderly people, medical personnel as well as the healthcare system in general. Identifying the main pain points in the current. The post Calling the global data science community to #HACKCORONA appeared first on Dataconomy.

article thumbnail

Azure has the most Cloud Regions, and it's not even close

Data Science 101

The big cloud providers are expanding globally by adding more Global Regions. Google recently announced a new mountain west region. Plus, all the other providers have plans to expand globally. This got me wondering, which provider has the most global regions. I went to all the big cloud provider websites, and I was a bit surprised with the results. Google Cloud Regions.

Azure 145
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

5 Ingenious Ways To Use Big Data For Customer Engagement

Smart Data Collective

Big data is changing the direction of our economy in unprecedented ways. Every business should look for ways to monetize big data and use it to optimize your business model. The number of companies using big data is growing at an accelerated rate. One poll found that 53% of businesses were using big data analytics in 2017. This figure has presumably risen in the years since.

Big Data 142
article thumbnail

Predicting COVID-19 on the U.S. County Level

DataRobot

With the fight against COVID-19 spreading across the U.S. and the world, DataRobot understands it is essential that federal government entities convey accurate information to citizens, local governments, and healthcare providers. Towards that end, DataRobot’s enterprise AI platform has developed models to predict which U.S. counties are likely to have their first confirmed COVID-19 cases in the next five days.

AI 132

More Trending

article thumbnail

Coronavirus Data and Poll Analysis – yes, there is hope, if we act now

KDnuggets

We examine the growth of coronavirus daily cases in most affected countries, and show evidence that social distancing works in reducing the rate of spread. We also analyze KDnuggets Poll results - the scale of change to online and how Data Science work is likely to increase or drop in different regions. Stay Healthy and practice social distancing!

article thumbnail

HackCorona: 300 participants, 41 nationalities, 23 solutions to fight COVID-19 outbreak

Dataconomy

In just one day, the HackCorona initiative gathered over 1700 people and 300 selected hackers came up with 23 digital solutions to help the world fight the COVID-19 outbreak during the 48-hour long virtual hackathon by Data Natives and Hacking Health. Here are the results. HackCorona was created on March 17th. The post HackCorona: 300 participants, 41 nationalities, 23 solutions to fight COVID-19 outbreak appeared first on Dataconomy.

Big Data 231
article thumbnail

Hilary Mason – The Future of AI and Machine Learning

Data Science 101

Hilary Mason is the Founder of Fast Forward Labs. She has been involved in the data science space for over a decade. She is a real thought leader in the data space. This keynote was delivered at ODSC East 2020. The post Hilary Mason – The Future of AI and Machine Learning appeared first on Data Science 101.

article thumbnail

How Insurance Companies Use Data To Measure Risk And Choose Rates

Smart Data Collective

The auto insurance industry has always relied on data analysis to inform their policies and determine individual rates. With the technology available today, there’s even more data to draw from. The good news is that this new data can help lower your insurance rate. Here is the type of data insurance companies use to measure a client’s potential risk and determine rates.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Writing is Learning: How I Learned an Easier Way to Write

Eugene Yan

Writing begins before actually writing; it's a cycle of reading -> note-taking -> writing.

130
130
article thumbnail

TensorFlow 2.0 Tutorial for Deep Learning

Analytics Vidhya

TensorFlow 2.0 – a Major Update for the Deep Learning Community Just when I thought TensorFlow’s market share would be eaten by the emergence. The post TensorFlow 2.0 Tutorial for Deep Learning appeared first on Analytics Vidhya.

article thumbnail

Time Series Classification Synthetic vs Real Financial Time Series

KDnuggets

This article discusses distinguishing between real financial time series and synthetic time series using XGBoost.

399
399
article thumbnail

Why Data Scientists Must Be Able to Explain Their Algorithms

Dataconomy

The models you create have real-world applications that affect how your colleagues do their jobs. That means they need to understand what you’ve created, how it works, and what its limitations are. They can’t do any of these things if it’s all one big mystery they don’t understand. “I’m afraid. The post Why Data Scientists Must Be Able to Explain Their Algorithms appeared first on Dataconomy.

article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Elements of Data Science – A free Jupyter Notebook Textbook

Data Science 101

Elements of Data Science by Allen Downey is a freely available textbook. It consists of Jupyter Notebooks on Google Colab, so you can view and edit code if you want. This could be a great way to begin your data science and programming journey. The post Elements of Data Science – A free Jupyter Notebook Textbook appeared first on Data Science 101.

article thumbnail

Reasons For Transitioning To Cloud Computing In 2020

Smart Data Collective

Cloud computing has now become a common term that all of us have heard of. However, unfortunately, many of us still don’t understand the complete potential of cloud computing. It is high time for all us to understand how it can make our lives easier. Instead of storing data on a computer or hard drive , cloud computing stores programs and data over the internet.

article thumbnail

Be Humble: Black Swans and the Limits of Inductive Reasoning

DataRobot

After a decade of relative economic stability, we are now confronted by the COVID-19 pandemic, with many financial analysts labelling it as a ‘black swan’ event. A ‘black swan’ is a metaphor for something unexpected which has a major impact. These type of events can cause significant disruption to business processes, financial markets, and our lives.

AI 112
article thumbnail

Getting into Deep Learning? Here are 5 Things you Should Absolutely Know

Analytics Vidhya

Starting your Deep Learning Career? Deep learning can be a complex and daunting field for newcomers. Concepts like hidden layers, convolutional neural networks, backpropagation. The post Getting into Deep Learning? Here are 5 Things you Should Absolutely Know appeared first on Analytics Vidhya.

article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

The 4 Best Jupyter Notebook Environments for Deep Learning

KDnuggets

Many cloud providers, and other third-party services, see the value of a Jupyter notebook environment which is why many companies now offer cloud hosted notebooks that are hosted on the cloud. Let's have a look at 3 such environments.

article thumbnail

The coronavirus shows us how tech companies can do more against fake news

Dataconomy

The spread of the coronavirus has become a rare event in which the entire world is affected and concerned. Open the newspaper, Facebook, or talk to literally anyone and the virus is the first topic that pops up. Initially, you might have thought the virus was just flu. Not a. The post The coronavirus shows us how tech companies can do more against fake news appeared first on Dataconomy.

article thumbnail

Emily Glassberg Sands – How Data Science Can Unlock Teaching & Learning at Scale

Data Science 101

Emily Glassberg Sands is the Head of Data Science at Coursera. This is a nice talk about how Coursera uses data science to improve the scale of teaching and learning. This talk was delivered at Women in Data Science 2020. The post Emily Glassberg Sands – How Data Science Can Unlock Teaching & Learning at Scale appeared first on Data Science 101.

article thumbnail

How Big Data Has Revolutionized the Gaming Industry

Smart Data Collective

Big data is driving a number of changes in our lives. Forbes recently wrote an article about the impact of big data on the food and hospitality industry. However, other sectors are changing as well. Big data phenomenon has revolutionized almost every aspect of an average citizen’s life. Information about our online activity has been accumulating for years, and now is actively used to know more about us.

Big Data 131
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

AI Simplified: What Makes a Good Machine Learning Use Case?

DataRobot

You’ve heard about how AI and machine learning are transforming businesses across industries and around the world, but you don’t know how they’ve done it or even where to start. Figuring out the right problems to solve with machine learning can be a daunting exercise. Where are the good opportunities to leverage AI within your organization? What problems are best solved with AI?

article thumbnail

Build a Decision Tree in Minutes using Weka (No Coding Required!)

Analytics Vidhya

Learn how to build a decision tree model using Weka This tutorial is perfect for newcomers to machine learning and decision trees, and those. The post Build a Decision Tree in Minutes using Weka (No Coding Required!) appeared first on Analytics Vidhya.

article thumbnail

Covid-19, your community, and you — a data science perspective

KDnuggets

Let's talk about covid-19; the reality, the numbers, and the data science.

article thumbnail

How to Stop Fetishizing AI

Dataconomy

Our misguided perceptions of AI confuse the vital public debate about AI’s role in society by mitigating its severity and exaggerating its impact. Artificial Intelligence is sexy. It’s been able to translate between languages, recommend us new TV shows to watch, and beat humans at everything from Go to Jeopardy. . The post How to Stop Fetishizing AI appeared first on Dataconomy.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Google Video – Rules of Machine Learning

Data Science 101

To be great with machine learning, it helps to be a great engineer. That means doing the following: write simple code make it readable comment it fix the ever present sign mistake leverage peer review and version control track performance launch and iterate. Those are the general rules for software engineering, this video contains some specific rules for software with machine learning.

article thumbnail

3 Industries Adapting to Major AI Advances in 2020

Smart Data Collective

The market for AI is changing in spectacular ways. It is estimated that the market for artificial intelligence is going to be worth nearly $400 billion by the year 2025. Some industries are driving growth for AI in impressive ways. This is having some major changes on our everyday lives, as well as the operations of many businesses. These days, it seems that human life is becoming more and more intertwined with Artificial Intelligence.

article thumbnail

DataRobot Named One Of CB Insights’ 100 Most Innovative AI Startups

DataRobot

For the fourth year in a row, we’ve been named one of the world’s most innovative artificial intelligence startups by CB Insights for our pioneering approach to democratizing enterprise AI. CB Insights’ AI 100 list recognizes the most promising and impressive private AI companies worldwide.

article thumbnail

spaCy Tutorial to Learn and Master Natural Language Processing (NLP)

Analytics Vidhya

Introduction spaCy is my go-to library for Natural Language Processing (NLP) tasks. I’d venture to say that’s the case for the majority of NLP. The post spaCy Tutorial to Learn and Master Natural Language Processing (NLP) appeared first on Analytics Vidhya.

article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!