October, 2019

article thumbnail

Feature Selection: Beyond feature importance?

KDnuggets

In this post, you will see 3 different techniques of how to do Feature Selection to your datasets and how to build an effective predictive model.

article thumbnail

Add Shine to your Data Science Resume with these 8 Ambitious Projects on GitHub

Analytics Vidhya

Overview Here are eight ambitious data science projects to add to your data science portfolio We have divided these projects into three categories – The post Add Shine to your Data Science Resume with these 8 Ambitious Projects on GitHub appeared first on Analytics Vidhya.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Consumer data might be the new oil, but who gets to decide how it’s used?

Dataconomy

From the Cambridge Analytica scandal to GDPR and data breach headlines, the idea that consumers should know how their data is used is gaining traction with governments and consumer groups. What does this trend mean for companies that rely on consumer data for their business model? Right now, consumer data. The post Consumer data might be the new oil, but who gets to decide how it’s used?

Big Data 190
article thumbnail

DataScience SG x ODSC Meetup - Applying ML to Healthcare

Eugene Yan

In-depth sharing on how to put machine learning systems into production.

ML 130
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

How Data-Driven Marketing Helps Improve Conversion Rates

Smart Data Collective

According to a recent Adobe report , marketers have identified data-driven marketing as the most important business opportunity for 2019. That clearly indicates the importance that marketers give to data and why you should too. After all, data-driven marketing can help you achieve numerous benefits for your business, ultimately leading to an increase in conversions.

Analytics 110
article thumbnail

Microsoft Research Open Data Search

Data Science 101

Microsoft Research Open Data is a search engine for free datasets available from Microsoft Research. The datasets are primarily aimed at Natural Language Processing (NLP) and computer vision. Take a look if you are in need of a dataset for your next project.

More Trending

article thumbnail

Mathematics behind Machine Learning – The Core Concepts you Need to Know

Analytics Vidhya

Overview Here’s an intuitive and beginner friendly guide to the mathematics behind machine learning Learn the various math concepts required for machine learning, including. The post Mathematics behind Machine Learning – The Core Concepts you Need to Know appeared first on Analytics Vidhya.

article thumbnail

MLOps can help overcome risk in AI and ML projects

Dataconomy

Aleksandar Kova?evi?, Sales Engineer at InterSystems, shares how companies use MLOps combined with a central multi-model database to get the most out of their machine learning initiatives. Artificial Intelligence (AI) and Machine Learning (ML) are hot topics at the moment. But when it comes to producing quantifiable results, there is. The post MLOps can help overcome risk in AI and ML projects appeared first on Dataconomy.

ML 183
article thumbnail

OLX Prod Tech 2019 Keynote - Asia's Tech Giants

Eugene Yan

Keynote on how Asia's tech giants scale and their SuperApp strategy.

130
130
article thumbnail

Artificial Intelligence and Big Data in Higher Education: Promising or Perilous?

Smart Data Collective

What exactly is artificial intelligence (AI) and what business does it have in higher education? Simply put, AI is an attempt to emulate human knowledge by programming extensive rules into computers. Through machine learning and expert systems, machines can produce patterns within mass flows of data and pinpoint correlations that couldn’t possibly be immediately intuitive to humans.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Why You should Attend SQLSaturday – An Interview with John Byrnes

Data Science 101

A few weeks ago, I attended my first SQLSaturday event. I brought along my camera and was lucky enough to record a couple interviews. This is one of those interviews. I sat down with John Byrnes and we discussed: Where has SQL taken his career? What does the SQLSaturday community mean to him? Why should someone attend a SQLSaturday event? SQLSaturday Interview with John Byrnes.

SQL 92
article thumbnail

Everything a Data Scientist Should Know About Data Management

KDnuggets

For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions.

article thumbnail

Knowledge Graph – A Powerful Data Science Technique to Mine Information from Text (with Python code)

Analytics Vidhya

Overview Knowledge graphs are one of the most fascinating concepts in data science Learn how to build a knowledge graph using text from Wikipedia. The post Knowledge Graph – A Powerful Data Science Technique to Mine Information from Text (with Python code) appeared first on Analytics Vidhya.

article thumbnail

Not All Pregnancies Count: The Data Gap on Maternal Health and Jails

Dataconomy

Data Science is a remarkable field that enables researchers to improve the quality of life for humanity. However, officials have failed to use the technology to benefit one of the nation’s most at-risk groups – incarcerated women. Researchers are taking a closer look at typically overlooked jailhouse pregnancies in a. The post Not All Pregnancies Count: The Data Gap on Maternal Health and Jails appeared first on Dataconomy.

article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Grand Slam Analytics: Who Will Win the World Series?

DataRobot

This blog is meant to be a fun and unique take on predicting the 2019 MLB World Series winner.

article thumbnail

Big Data In Hockey Takes The Sport By Storm

Smart Data Collective

While sports analytics is hardly a new idea, how it is being used in hockey has changed dramatically. Coaches no longer have to wait for newspapers to print out statistics. Fans are not left to wonder how their favorite players are truly excelling. What has brought about this change? Big data. Big data has taken all forms of sports analytics by storm, but we are only now really beginning to see what it can do for hockey.

Big Data 108
article thumbnail

Fundamentals of Data Mining

Data Science 101

Today we are generating data more than ever before. Over the last two years, 90 percent of the data in the world was generated. This data alone does not make any sense unless it’s identified to be related in some pattern. Data mining is the process of discovering these patterns among the data and is therefore also known as Knowledge Discovery from Data (KDD).

article thumbnail

Introduction to Natural Language Processing (NLP)

KDnuggets

Have you ever wondered how your personal assistant (e.g: Siri) is built? Do you want to build your own? Perfect! Let’s talk about Natural Language Processing.

article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

Hands-On Introduction to Web Scraping in Python: A Powerful Way to Extract Data for your Data Science Project

Analytics Vidhya

Overview Web scraping is a highly effective method to extract data from websites (depending on the website’s regulations) Learn how to perform web scraping. The post Hands-On Introduction to Web Scraping in Python: A Powerful Way to Extract Data for your Data Science Project appeared first on Analytics Vidhya.

article thumbnail

A Data Scientist’s relationship with building Predictive Models

Dataconomy

If you’re a Data Scientist, you’ve likely spent months earnestly developing and then deploying a single predictive model. The truth is that once your model is built – that’s only half the battle won. A quarter of a Data Scientist’s working life often goes something like this: You met with. The post A Data Scientist’s relationship with building Predictive Models appeared first on Dataconomy.

article thumbnail

You Can’t Sell Shampoo to a Bald Guy: AI Best Practices for Marketing

DataRobot

Recently, my Facebook feed has become clogged with advertisements. Many of those advertisements are selling products that aren’t a good match for me, including ones for shampoo and hair care products!

AI 15
article thumbnail

Leveraging Big Data With State-Of-The-Art Business Dashboards

Smart Data Collective

There are a lot of ways that organizations can leverage big data. Most of them don’t have difficulty collecting the data they need to make more informed decisions. However, they often struggle to conceptualize the data and present it in a format that supports their conclusions. This is one of the areas where a business dashboard can be useful. Big data has made business dashboards possible.

Big Data 103
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

Cut out everything that’s not surprising

Hacker News

This is my advice to anyone writing something for the public — especially a talk on stage. People listen to a talk, or read an article, because they want to learn something new. They want a little “oh wow” moment. “I never thought of it that way before.”. People only really learn when they’re surprised. If they’re not surprised, then what you told them just fits in with what they already know.

65
article thumbnail

10 Free Top Notch Natural Language Processing Courses

KDnuggets

Are you looking to learn natural language processing? This collection of 10 free top notch courses will allow you to do just that, with something for every approach to learning NLP and its varied topics.

article thumbnail

A Comprehensive Guide to Learn Swift from Scratch for Data Science

Analytics Vidhya

Overview Swift is quickly becoming one of the most powerful and effective languages for data science Swift is quite similar to Python so you’ll. The post A Comprehensive Guide to Learn Swift from Scratch for Data Science appeared first on Analytics Vidhya.

article thumbnail

The Medflix Principle : How to use curated content for healthcare

Dataconomy

Netflix excels at attracting users to its platform and hooking them on to the content it provides. For several years, pharmaceutical companies have wanted to achieve the same thing with specialised medical content. But is it that easy to transfer the “Netflix principle” to the pharmaceutical industry? What kind of. The post The Medflix Principle : How to use curated content for healthcare appeared first on Dataconomy.

142
142
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Data Science Fails: There's No Such Thing As A Free Lunch

DataRobot

When I was young, I took a packed lunch to school every day, and since I grew up in Australia, my packed lunch would include a couple of Vegemite sandwiches. Unless you grew up in Australia, you’ve probably never tasted it. And judging by this American’s first taste reaction of “Oh, that’s bad!”, you probably wouldn’t like the taste if you tried it out.

article thumbnail

Data-Driven Digital Marketing Carves Competitive Edge For SMEs

Smart Data Collective

Big data is playing a vital role in the evolution of small business. A compilation of research from the G2 Learning Hub Shows the number of businesses relying on big data is rising. They cited one study showing that 40% of businesses need to use unstructured data on a nearly daily basis. One of the ways that businesses can gain an edge is with digital marketing strategies that hinge on big data.

Big Data 102
article thumbnail

Introducing spaCy v2.2

Explosion

Version 2.2 of the spaCy Natural Language Processing library is leaner, cleaner and even more user-friendly. In addition to new model packages and features for training, evaluation and serialization, we’ve made lots of bug fixes, improved debugging and error handling, and greatly reduced the size of the library on disk. While we’re grateful to the whole spaCy community for their patches and support, Explosion has been lucky to welcome two new team members who deserve special credit for the recen

article thumbnail

The 5 Classification Evaluation Metrics Every Data Scientist Must Know

KDnuggets

This post is about various evaluation metrics and how and when to use them.

article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!