March, 2020

article thumbnail

10 Awesome Data Manipulation and Wrangling Hacks, Tips and Tricks

Analytics Vidhya

Introduction “Efficiency is doing things right. Effectiveness is doing the right thing.” – Zig Zagler As data scientists, we are often taught to be. The post 10 Awesome Data Manipulation and Wrangling Hacks, Tips and Tricks appeared first on Analytics Vidhya.

article thumbnail

20+ Machine Learning Datasets & Project Ideas

KDnuggets

Upgrading your machine learning, AI, and Data Science skills requires practice. To practice, you need to develop models with a large amount of data. Finding good datasets to work with can be challenging, so this article discusses more than 20 great datasets along with machine learning project ideas for you to tackle today.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Calling the global data science community to #HACKCORONA

Dataconomy

COVID-19 is still spreading exponentially throughout the world. Current statistics indicate that 15-20% of people who get it require hospitalization for respiratory failure for multiple weeks. The hardship falls on elderly people, medical personnel as well as the healthcare system in general. Identifying the main pain points in the current. The post Calling the global data science community to #HACKCORONA appeared first on Dataconomy.

article thumbnail

What is the most effective policy response to the new coronavirus pandemic?

Machine Learning (Theory)

Disclaimer: I am not an epidemiologist, but there is an interesting potentially important pattern in the data that seems worth understanding. World healthcare authorities appear to be primarily shifting towards Social Distancing. However, there is potential to pursue a different strategy in the medium term that exploits a vulnerability of this disease: the 5 day incubation time is much longer than a 4 hour detection time.

130
130
article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Azure has the most Cloud Regions, and it's not even close

Data Science 101

The big cloud providers are expanding globally by adding more Global Regions. Google recently announced a new mountain west region. Plus, all the other providers have plans to expand globally. This got me wondering, which provider has the most global regions. I went to all the big cloud provider websites, and I was a bit surprised with the results. Google Cloud Regions.

Azure 145
article thumbnail

Predicting COVID-19 on the U.S. County Level

DataRobot

With the fight against COVID-19 spreading across the U.S. and the world, DataRobot understands it is essential that federal government entities convey accurate information to citizens, local governments, and healthcare providers. Towards that end, DataRobot’s enterprise AI platform has developed models to predict which U.S. counties are likely to have their first confirmed COVID-19 cases in the next five days.

AI 132

More Trending

article thumbnail

Coronavirus Data and Poll Analysis – yes, there is hope, if we act now

KDnuggets

We examine the growth of coronavirus daily cases in most affected countries, and show evidence that social distancing works in reducing the rate of spread. We also analyze KDnuggets Poll results - the scale of change to online and how Data Science work is likely to increase or drop in different regions. Stay Healthy and practice social distancing!

article thumbnail

HackCorona: 300 participants, 41 nationalities, 23 solutions to fight COVID-19 outbreak

Dataconomy

In just one day, the HackCorona initiative gathered over 1700 people and 300 selected hackers came up with 23 digital solutions to help the world fight the COVID-19 outbreak during the 48-hour long virtual hackathon by Data Natives and Hacking Health. Here are the results. HackCorona was created on March 17th. The post HackCorona: 300 participants, 41 nationalities, 23 solutions to fight COVID-19 outbreak appeared first on Dataconomy.

Big Data 231
article thumbnail

Writing is Learning: How I Learned an Easier Way to Write

Eugene Yan

Writing begins before actually writing; it's a cycle of reading -> note-taking -> writing.

130
130
article thumbnail

Hilary Mason – The Future of AI and Machine Learning

Data Science 101

Hilary Mason is the Founder of Fast Forward Labs. She has been involved in the data science space for over a decade. She is a real thought leader in the data space. This keynote was delivered at ODSC East 2020. The post Hilary Mason – The Future of AI and Machine Learning appeared first on Data Science 101.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

5 Ingenious Ways To Use Big Data For Customer Engagement

Smart Data Collective

Big data is changing the direction of our economy in unprecedented ways. Every business should look for ways to monetize big data and use it to optimize your business model. The number of companies using big data is growing at an accelerated rate. One poll found that 53% of businesses were using big data analytics in 2017. This figure has presumably risen in the years since.

Big Data 113
article thumbnail

TensorFlow 2.0 Tutorial for Deep Learning

Analytics Vidhya

TensorFlow 2.0 – a Major Update for the Deep Learning Community Just when I thought TensorFlow’s market share would be eaten by the emergence. The post TensorFlow 2.0 Tutorial for Deep Learning appeared first on Analytics Vidhya.

article thumbnail

The 4 Best Jupyter Notebook Environments for Deep Learning

KDnuggets

Many cloud providers, and other third-party services, see the value of a Jupyter notebook environment which is why many companies now offer cloud hosted notebooks that are hosted on the cloud. Let's have a look at 3 such environments.

article thumbnail

Why Data Scientists Must Be Able to Explain Their Algorithms

Dataconomy

The models you create have real-world applications that affect how your colleagues do their jobs. That means they need to understand what you’ve created, how it works, and what its limitations are. They can’t do any of these things if it’s all one big mystery they don’t understand. “I’m afraid. The post Why Data Scientists Must Be Able to Explain Their Algorithms appeared first on Dataconomy.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Be Humble: Black Swans and the Limits of Inductive Reasoning

DataRobot

After a decade of relative economic stability, we are now confronted by the COVID-19 pandemic, with many financial analysts labelling it as a ‘black swan’ event. A ‘black swan’ is a metaphor for something unexpected which has a major impact. These type of events can cause significant disruption to business processes, financial markets, and our lives.

AI 112
article thumbnail

Elements of Data Science – A free Jupyter Notebook Textbook

Data Science 101

Elements of Data Science by Allen Downey is a freely available textbook. It consists of Jupyter Notebooks on Google Colab, so you can view and edit code if you want. This could be a great way to begin your data science and programming journey. The post Elements of Data Science – A free Jupyter Notebook Textbook appeared first on Data Science 101.

article thumbnail

How Insurance Companies Use Data To Measure Risk And Choose Rates

Smart Data Collective

The auto insurance industry has always relied on data analysis to inform their policies and determine individual rates. With the technology available today, there’s even more data to draw from. The good news is that this new data can help lower your insurance rate. Here is the type of data insurance companies use to measure a client’s potential risk and determine rates.

article thumbnail

spaCy Tutorial to Learn and Master Natural Language Processing (NLP)

Analytics Vidhya

Introduction spaCy is my go-to library for Natural Language Processing (NLP) tasks. I’d venture to say that’s the case for the majority of NLP. The post spaCy Tutorial to Learn and Master Natural Language Processing (NLP) appeared first on Analytics Vidhya.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

What is the most effective policy response to the new coronavirus pandemic?

KDnuggets

Where Test/Trace/Quarantine are working, the number of cases/day have declined empirically. Furthermore, this appears to be a radically superior strategy where it can be deployed. I’ll review the evidence, discuss the other strategies and their consequences, and then discuss what can be done.

370
370
article thumbnail

The coronavirus shows us how tech companies can do more against fake news

Dataconomy

The spread of the coronavirus has become a rare event in which the entire world is affected and concerned. Open the newspaper, Facebook, or talk to literally anyone and the virus is the first topic that pops up. Initially, you might have thought the virus was just flu. Not a. The post The coronavirus shows us how tech companies can do more against fake news appeared first on Dataconomy.

article thumbnail

AI in Turbulent Times: Navigating Changing Conditions Webinar

DataRobot

Data science teams are scrambling to update their models in the wake of extreme and unforeseen worldwide changes brought on by the global COVID-19 pandemic. In the face of these unprecedented events, one of the key concerns among many data scientists is that their current models could be generating inaccurate or misleading predictions. In the webinar, AI in Turbulent Times: Navigating Changing Conditions , we outlined the steps that data scientists can take to incorporate robustness into their m

article thumbnail

Emily Glassberg Sands – How Data Science Can Unlock Teaching & Learning at Scale

Data Science 101

Emily Glassberg Sands is the Head of Data Science at Coursera. This is a nice talk about how Coursera uses data science to improve the scale of teaching and learning. This talk was delivered at Women in Data Science 2020. The post Emily Glassberg Sands – How Data Science Can Unlock Teaching & Learning at Scale appeared first on Data Science 101.

article thumbnail

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

Speaker: Kevin Kai Wong, President of Emergent Energy Solutions

In today's industrial landscape, the pursuit of sustainable energy optimization and decarbonization has become paramount. Manufacturing corporations across the U.S. are facing the urgent need to align with decarbonization goals while enhancing efficiency and productivity. Unfortunately, the lack of comprehensive energy data poses a significant challenge for manufacturing managers striving to meet their targets.

article thumbnail

Reasons For Transitioning To Cloud Computing In 2020

Smart Data Collective

Cloud computing has now become a common term that all of us have heard of. However, unfortunately, many of us still don’t understand the complete potential of cloud computing. It is high time for all us to understand how it can make our lives easier. Instead of storing data on a computer or hard drive , cloud computing stores programs and data over the internet.

article thumbnail

Build a Decision Tree in Minutes using Weka (No Coding Required!)

Analytics Vidhya

Learn how to build a decision tree model using Weka This tutorial is perfect for newcomers to machine learning and decision trees, and those. The post Build a Decision Tree in Minutes using Weka (No Coding Required!) appeared first on Analytics Vidhya.

article thumbnail

When Will AutoML replace Data Scientists? Poll Results and Analysis

KDnuggets

Will AI always be 5-10 years away? The majority of respondents to this poll think that AutoML will reach expert level in 5-10 years. Interestingly, it is about the same as 5 years ago. We examine the trends by AutoML experience, industry, and region.

article thumbnail

How to Stop Fetishizing AI

Dataconomy

Our misguided perceptions of AI confuse the vital public debate about AI’s role in society by mitigating its severity and exaggerating its impact. Artificial Intelligence is sexy. It’s been able to translate between languages, recommend us new TV shows to watch, and beat humans at everything from Go to Jeopardy. . The post How to Stop Fetishizing AI appeared first on Dataconomy.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Identifying Leakage in Computer Vision on Medical Images

DataRobot

Computer vision has been suggested to help with the battle against COVID-19. In this article, we want to share our preliminary research into two different datasets. Confirming existing results , we found defects with the existing approaches, due to leakage when building COVID image datasets from heterogeneous sources. We explain here why these issues occurred, using interpretability tools to detect them.

AI 93
article thumbnail

Google Video – Rules of Machine Learning

Data Science 101

To be great with machine learning, it helps to be a great engineer. That means doing the following: write simple code make it readable comment it fix the ever present sign mistake leverage peer review and version control track performance launch and iterate. Those are the general rules for software engineering, this video contains some specific rules for software with machine learning.

article thumbnail

How Big Data Has Revolutionized the Gaming Industry

Smart Data Collective

Big data is driving a number of changes in our lives. Forbes recently wrote an article about the impact of big data on the food and hospitality industry. However, other sectors are changing as well. Big data phenomenon has revolutionized almost every aspect of an average citizen’s life. Information about our online activity has been accumulating for years, and now is actively used to know more about us.

Big Data 103
article thumbnail

Getting into Deep Learning? Here are 5 Things you Should Absolutely Know

Analytics Vidhya

Starting your Deep Learning Career? Deep learning can be a complex and daunting field for newcomers. Concepts like hidden layers, convolutional neural networks, backpropagation. The post Getting into Deep Learning? Here are 5 Things you Should Absolutely Know appeared first on Analytics Vidhya.

article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.