Sat.Oct 05, 2019 - Fri.Oct 11, 2019

article thumbnail

10 Free Top Notch Natural Language Processing Courses

KDnuggets

Are you looking to learn natural language processing? This collection of 10 free top notch courses will allow you to do just that, with something for every approach to learning NLP and its varied topics.

article thumbnail

Hands-On Introduction to Web Scraping in Python: A Powerful Way to Extract Data for your Data Science Project

Analytics Vidhya

Overview Web scraping is a highly effective method to extract data from websites (depending on the website’s regulations) Learn how to perform web scraping. The post Hands-On Introduction to Web Scraping in Python: A Powerful Way to Extract Data for your Data Science Project appeared first on Analytics Vidhya.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Consumer data might be the new oil, but who gets to decide how it’s used?

Dataconomy

From the Cambridge Analytica scandal to GDPR and data breach headlines, the idea that consumers should know how their data is used is gaining traction with governments and consumer groups. What does this trend mean for companies that rely on consumer data for their business model? Right now, consumer data. The post Consumer data might be the new oil, but who gets to decide how it’s used?

Big Data 190
article thumbnail

DataScience SG x ODSC Meetup - Applying ML to Healthcare

Eugene Yan

In-depth sharing on how to put machine learning systems into production.

ML 130
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

The 4 Quadrants of Data Science Skills and 7 Principles of Marie Kondo approach to Data Visualization

KDnuggets

As a data scientist, your most important skill is creating meaningful visualizations to disseminate knowledge and impact your organization or client. These seven principals will guide you toward developing charts with clarity, as exemplified with data from a recent KDnuggets poll.

article thumbnail

Deployed your Machine Learning Model? Here’s What you Need to Know About Post-Production Monitoring

Analytics Vidhya

Overview What are the next steps after you’ve deployed your machine learning model? Post-deployment monitoring is a crucial step in any machine learning project. The post Deployed your Machine Learning Model? Here’s What you Need to Know About Post-Production Monitoring appeared first on Analytics Vidhya.

More Trending

article thumbnail

Microsoft Research Open Data Search

Data Science 101

Microsoft Research Open Data is a search engine for free datasets available from Microsoft Research. The datasets are primarily aimed at Natural Language Processing (NLP) and computer vision. Take a look if you are in need of a dataset for your next project.

article thumbnail

Activation maps for deep learning models in a few lines of code

KDnuggets

We illustrate how to show the activation maps of various layers in a deep CNN model with just a couple of lines of code.

article thumbnail

A Detailed Guide to the Powerful SIFT Technique for Image Matching (with Python code)

Analytics Vidhya

Overview A beginner-friendly introduction to the powerful SIFT (Scale Invariant Feature Transform) technique Learn how to perform Feature Matching using SIFT We also showcase. The post A Detailed Guide to the Powerful SIFT Technique for Image Matching (with Python code) appeared first on Analytics Vidhya.

Python 282
article thumbnail

Leveraging Big Data With State-Of-The-Art Business Dashboards

Smart Data Collective

There are a lot of ways that organizations can leverage big data. Most of them don’t have difficulty collecting the data they need to make more informed decisions. However, they often struggle to conceptualize the data and present it in a format that supports their conclusions. This is one of the areas where a business dashboard can be useful. Big data has made business dashboards possible.

Big Data 103
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

You Can’t Sell Shampoo to a Bald Guy: AI Best Practices for Marketing

DataRobot

Recently, my Facebook feed has become clogged with advertisements. Many of those advertisements are selling products that aren’t a good match for me, including ones for shampoo and hair care products!

AI 15
article thumbnail

Introduction to Artificial Neural Networks

KDnuggets

In this article, we’ll try to cover everything related to Artificial Neural Networks or ANN.

301
301
article thumbnail

7 Amazing NLP Hack Sessions to Watch out for at DataHack Summit 2019

Analytics Vidhya

Picture a world where: Machines are able to have human-level conversations with us Computers understand the context of the conversation without having to be. The post 7 Amazing NLP Hack Sessions to Watch out for at DataHack Summit 2019 appeared first on Analytics Vidhya.

Analytics 234
article thumbnail

Integrating Sinatra Into Ruby To Expedite Application Development

Smart Data Collective

Sinatra is a great web application library. It can be used to streamline development in Ruby and several other programming languages. A number of articles on Dzone have been written on using Sinatra to develop web applications and other solutions, but there didn’t appear to be an article on setting up Sinatra. We decided to provide a quick tutorial on this process.

87
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Going AI: Why Retailers Need More (ML) Models than London’s Fashion Week

DataRobot

The best models on the planet were in London recently. At the London Fashion week, we saw the top supermodels from around the world come together to showcase the newest styles and fashion. At the same time, DataRobot was in town at the Retail Tech Event parading their automated machine learning models to the retail industry. Both types of models make a big impact on the retail industry, but it is DataRobot’s models that have the power to reshape the future of retail as we know it.

ML 14
article thumbnail

Data Science is Boring (Part 2)

KDnuggets

Why I love boring ML problems and how I think about them.

article thumbnail

Using spaCy with Hugging Face Transformers

Explosion

Transformer models like BERT have set a new standard for accuracy on almost every NLP leaderboard. However, these models are very new, and most of the software ecosystem surrounding them is oriented towards the many opportunities for further research. In this talk, Matt describes how you can now use these models in spaCy to work on real problems and the many opportunities transfer learningfor production NLP, regardless of which software packages you choose.

52
article thumbnail

Talking Data Protection with Arcserve CTO Oussama El-Hilali & VP of Strategic Partnerships Clark Brown

DataCentric podcast

In the data era, data protection is everything. Cyber resilience can mean life and death for companies that rely on intelligence to drive business. So what’s more important? Securing your organization? Or being able to quickly recover from cyberattacks without having to pay ridiculous ransom amounts to a group of hackers in Eastern Europe? The answer is “both.

40
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

Intelligent Automation for the Public Sector

DataRobot

What is Intelligent Automation? Intelligent Automation (IA) refers to the application of artificial intelligence (AI) and related technologies in particular combining machine learning , and robotic process automation (RPA). This convergence of technologies produces automation capabilities that dramatically elevate business value and competitive advantages for organizations.

article thumbnail

Math for Programmers.

KDnuggets

Math for Programmers teaches you the math you need to know for a career in programming, concentrating on what you need to know as a developer.

264
264
article thumbnail

Explosion awarded META Seal of Recognition

Explosion

We’re proud to accept the META Seal of Recognition at META-FORUM in Brussels, along with Mozilla. The META-FORUM is an international conference series backed by the European Union on powerful and innovative Language Technologies for a multilingual information society.

40
article thumbnail

Contributing to PyTorch: By someone who doesn’t know a ton about PyTorch

KDnuggets

By the end of my week with the team, I managed to proudly cut two PRs on GitHub. I decided that I would write a blog post to knowledge share, not just to show that YES, you can too.

Python 256
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

8 Paths to Getting a Machine Learning Job Interview

KDnuggets

While you may be focused on your performance during your next job interview, landing that interview can be just as hard. Check out these tips for finding and securing an interview for a machine learning job.

article thumbnail

The problem with metrics is a big problem for AI

KDnuggets

The practice of optimizing metrics is not new nor unique to AI, yet AI can be particularly efficient (even too efficient!) at doing so.

AI 250
article thumbnail

There is No Such Thing as a Free Lunch: Part 1

KDnuggets

You have heard the expression “there is no such thing as a free lunch” – well in machine learning the same principle holds. In fact there is even a theorem with the same name.

article thumbnail

Four questions to help accurately scope analytics engineering project

KDnuggets

Being really good at scoping analytics projects is crucial for team productivity and profitability. You can consistently deliver on time if you work out the issue first, and these four questions can help you prepare.

Analytics 242
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Beyond Word Embedding: Key Ideas in Document Embedding

KDnuggets

This literature review on document embedding techniques thoroughly covers the many ways practitioners develop rich vector representations of text -- from single sentences to entire books.

237
237
article thumbnail

Lemma, Lemma, Red Pyjama: Or, doing words with AI

KDnuggets

If we want a machine learning model to be able to generalize these forms together, we need to map them to a shared representation. But when are two different words the same for our purposes? It depends.

article thumbnail

Why the ‘why way’ is the right way to restoring trust in AI

KDnuggets

As so many more organizations now rely on AI to deliver services and consumer experiences, establishing a public trust in the AI is crucial as these systems begin to make harder decisions that impact customers.

AI 226
article thumbnail

Math in Our Lives video collection from SIAM

KDnuggets

Having trouble explaining why applied math matters to your non-specialist friends and colleagues? As valued members of the applied math community and ambassadors of SIAM, review these short animations and share them with your interested networks! Help us show that math matters and why.

218
218
article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!