Sat.Mar 21, 2020 - Fri.Mar 27, 2020

article thumbnail

10 Awesome Data Manipulation and Wrangling Hacks, Tips and Tricks

Analytics Vidhya

Introduction “Efficiency is doing things right. Effectiveness is doing the right thing.” – Zig Zagler As data scientists, we are often taught to be. The post 10 Awesome Data Manipulation and Wrangling Hacks, Tips and Tricks appeared first on Analytics Vidhya.

article thumbnail

Coronavirus Data and Poll Analysis – yes, there is hope, if we act now

KDnuggets

We examine the growth of coronavirus daily cases in most affected countries, and show evidence that social distancing works in reducing the rate of spread. We also analyze KDnuggets Poll results - the scale of change to online and how Data Science work is likely to increase or drop in different regions. Stay Healthy and practice social distancing!

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

HackCorona: 300 participants, 41 nationalities, 23 solutions to fight COVID-19 outbreak

Dataconomy

In just one day, the HackCorona initiative gathered over 1700 people and 300 selected hackers came up with 23 digital solutions to help the world fight the COVID-19 outbreak during the 48-hour long virtual hackathon by Data Natives and Hacking Health. Here are the results. HackCorona was created on March 17th. The post HackCorona: 300 participants, 41 nationalities, 23 solutions to fight COVID-19 outbreak appeared first on Dataconomy.

Big Data 231
article thumbnail

5 Ingenious Ways To Use Big Data For Customer Engagement

Smart Data Collective

Big data is changing the direction of our economy in unprecedented ways. Every business should look for ways to monetize big data and use it to optimize your business model. The number of companies using big data is growing at an accelerated rate. One poll found that 53% of businesses were using big data analytics in 2017. This figure has presumably risen in the years since.

Big Data 142
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Using Graphs to Identify Social Media Influencers

Analytics Vidhya

Overview Learn how to use graphs to identify social media influencers We will demonstrate several techniques to identify these social media influencers and lay. The post Using Graphs to Identify Social Media Influencers appeared first on Analytics Vidhya.

Analytics 400
article thumbnail

Exploring TensorFlow Quantum, Google’s New Framework for Creating Quantum Machine Learning Models

KDnuggets

TensorFlow Quantum allow data scientists to build machine learning models that work on quantum architectures.

More Trending

article thumbnail

Hilary Mason – The Future of AI and Machine Learning

Data Science 101

Hilary Mason is the Founder of Fast Forward Labs. She has been involved in the data science space for over a decade. She is a real thought leader in the data space. This keynote was delivered at ODSC East 2020. The post Hilary Mason – The Future of AI and Machine Learning appeared first on Data Science 101.

article thumbnail

Free GPUs for Everyone! Get Started with Google Colab for Machine Learning and Deep Learning

Analytics Vidhya

Google Colab – Now Build Large Deep Learning Models on your Machine! “Memory Error” – that all too familiar dreaded message in Jupyter notebooks. The post Free GPUs for Everyone! Get Started with Google Colab for Machine Learning and Deep Learning appeared first on Analytics Vidhya.

article thumbnail

Why BERT Fails in Commercial Environments

KDnuggets

The deployment of large transformer-based models in dynamic commercial environments often yields poor results. This is because commercial environments are usually dynamic, and contain continuous domain shifts between inference and training data.

369
369
article thumbnail

How ensembles can reduce machine learning’s carbon footprint

Dataconomy

Commercial and industrial applications of artificial intelligence and machine learning are unlocking economic opportunities, transforming the way we do business, and even helping to solve complex social and environmental problems. In fact, generative applications of this technology have become tools for environmental sustainability. With machine learning’s capability to analyze and.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

How Insurance Companies Use Data To Measure Risk And Choose Rates

Smart Data Collective

The auto insurance industry has always relied on data analysis to inform their policies and determine individual rates. With the technology available today, there’s even more data to draw from. The good news is that this new data can help lower your insurance rate. Here is the type of data insurance companies use to measure a client’s potential risk and determine rates.

article thumbnail

Support Vector Regression Tutorial for Machine Learning

Analytics Vidhya

Unlocking a New World with the Support Vector Regression Algorithm Support Vector Machines (SVM) are popularly and widely used for classification problems in machine. The post Support Vector Regression Tutorial for Machine Learning appeared first on Analytics Vidhya.

article thumbnail

Want to Build an AI Model for Your Business? Read this

KDnuggets

The best approach for AI production is similar to what venture capitalists (VC’s) do when they evaluate and invest in startups.

AI 303
article thumbnail

Writing is Learning: How I Learned an Easier Way to Write

Eugene Yan

Writing begins before actually writing; it's a cycle of reading -> note-taking -> writing.

130
130
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

How Big Data Has Revolutionized the Gaming Industry

Smart Data Collective

Big data is driving a number of changes in our lives. Forbes recently wrote an article about the impact of big data on the food and hospitality industry. However, other sectors are changing as well. Big data phenomenon has revolutionized almost every aspect of an average citizen’s life. Information about our online activity has been accumulating for years, and now is actively used to know more about us.

Big Data 131
article thumbnail

Coronavirus Analysis: Will Social Distancing Help Prevent the Spread?

Analytics Vidhya

Introduction We are in the midst of a global crisis. The coronavirus, or COVID-19, has officially been declared a pandemic and it is wreaking. The post Coronavirus Analysis: Will Social Distancing Help Prevent the Spread? appeared first on Analytics Vidhya.

Analytics 343
article thumbnail

Made With ML: Discover, build, and showcase machine learning projects

KDnuggets

This is a short introduction to Made With ML, a useful resource for machine learning engineers looking to get ideas for projects to build, and for those looking to share innovative portfolio projects once built.

article thumbnail

AI in Turbulent Times: Navigating Changing Conditions Webinar

DataRobot

Data science teams are scrambling to update their models in the wake of extreme and unforeseen worldwide changes brought on by the global COVID-19 pandemic. In the face of these unprecedented events, one of the key concerns among many data scientists is that their current models could be generating inaccurate or misleading predictions. In the webinar, AI in Turbulent Times: Navigating Changing Conditions , we outlined the steps that data scientists can take to incorporate robustness into their m

article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

The Incredibly Important Role Of Big Data In Academia

Smart Data Collective

One of the most important elements in the evolution of the education system is the ability to make informed conclusions about the need to change approaches that are used and the actions that are taken. According to a 2015 whitepaper published in Science Direct , big data is one of the most disruptive technologies influencing the field of academia. The educational system continuously creates and accumulates a significant amount of data, and the question of the systematic work with these data by a

Big Data 121
article thumbnail

6 Python Libraries to Interpret Machine Learning Models and Build Trust

Analytics Vidhya

The Case for Building Trust in Machine Learning Models There are approximately 1.2 billion vehicles on the roads around the world. Here’s a bamboozling. The post 6 Python Libraries to Interpret Machine Learning Models and Build Trust appeared first on Analytics Vidhya.

article thumbnail

Evaluating Ray: Distributed Python for Massive Scalability

KDnuggets

If your team has started using ?Ray? and you’re wondering what it is, this post is for you. If you’re wondering if Ray should be part of your technical strategy for Python-based applications, especially ML and AI, this post is for you.

Python 279
article thumbnail

D.R.I.V.E. 2020 MLB Projections: DataRobot Intelligent Value Estimator

DataRobot

At DataRobot , we love problems that involve large sets of data, discrete cause-and-effect events, and difficult predictions; which makes baseball the ideal playground for our data scientists. On our normal days, we work closely with our customers to build prediction models that demystify the future with advanced machine learning techniques. With the current lack of baseball games to watch and enjoy due to delays to the 2020 season, we set our attention, skills, and technology towards exploring

article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

How Does AI Help Secure The Supply Chain?

Smart Data Collective

The supply chain is the source of every part of production. From raw materials to manufacturing to distribution, each step requires the most secure transition possible. Sometimes, though, the supply chain can come with various risks that affect these stages. Security issues arise from these vulnerabilities, and merchandise can be damaged or stolen, leading to more headaches and time-consuming procedures.

AI 118
article thumbnail

Data Science Papers for Spring 2020

Data Science 101

The world of data science is rapidly evolving. Here are a few data science papers I have found interesting. What’s Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities This paper is a study done on the usage of notebooks for data science. It cover a bunch of the negative impacts of using notebooks for data science. Deployment, setup, collaboration, and reliablity are a few of the examples.

article thumbnail

Top AI Resources – Directory for Remote Learning

KDnuggets

Whether you are just learning Data Science, a current professional, or just interested, it's crucial to keep the mind stimulated and stay current. With conferences, schools, and travel largely canceled because of #coronavirus, these remote resources will help you stay engaged.

article thumbnail

Knowledge Graph 101: How To Easily Query the Web without Web Scraping

DataRobot Blog

by Jen Underwood. NOTE: The following solution review is from a January 2020 project contract commitment. Both the vendor and I discussed the sensitive timing of publication. They decided not to cancel. Read More.

Tableau 72
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

The Growing Importance Of Machine Learning With Dedicated Servers

Smart Data Collective

Machine learning is changing the web hosting industry in countless ways. Many third-party hosting providers, such as Amazon Web Services have started utilizing machine learning in different capacities. Amazon Sagemaker is among the machine learning tools that have transformed the services that the platform offers to customers. Matthew Fryer, the Vice President and chief data science officer for Hotels.com has spoken very highly of this machine learning service.

article thumbnail

Cybersecurity in the age of the Intelligent Edge: A conversation with HPE and Aruba

DataCentric podcast

The Intelligent Edge, encompassing "devices that aren't in the datacenter", is growing at a nearly exponential rate. This challenges how traditional IT thinks about managing intelligent infrastructure, especially at the intersection of IT and OT, forcing everyone to think just a little bit differently -- all of this as nearly every company is forging their own paths.

52
article thumbnail

Diffusion Map for Manifold Learning, Theory and Implementation

KDnuggets

This article aims to introduce one of the manifold learning techniques called Diffusion Map. This technique enables us to understand the underlying geometric structure of high dimensional data as well as to reduce the dimensions, if required, by neatly capturing the non-linear relationships between the original dimensions.

article thumbnail

Data Science Papers for Spring 2020

Data Science 101

The world of data science is rapidly evolving. Here are a few data science papers I have found interesting. What’s Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities This paper is a study done on the usage of notebooks for data science. It cover a bunch of the negative impacts of using notebooks for data science. Deployment, setup, collaboration, and reliablity are a few of the examples.

article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!