June, 2020

article thumbnail

4 Simple Ways to Split a Decision Tree in Machine Learning

Analytics Vidhya

Overview How do you split a decision tree? What are the different splitting criteria when working with decision trees? Learn all about decision tree. The post 4 Simple Ways to Split a Decision Tree in Machine Learning appeared first on Analytics Vidhya.

article thumbnail

Applying AI Solutions at the Startup, Growth and Enterprise Stages

Dataconomy

We see companies applying AI solutions differently, depending on their growth stage. Here are the challenges they face and the best practices at each stage. A growing number of companies are seeking to apply artificial intelligence (AI) solutions, whether they want to launch disruptive products or innovate the customer experience. The post Applying AI Solutions at the Startup, Growth and Enterprise Stages appeared first on Dataconomy.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Eight Questions on Data Governance: An Interview with Alation Co-Founder and Chief Data Officer Aaron Kalb

Alation

The post Eight Questions on Data Governance: An Interview with Alation Co-Founder and Chief Data Officer Aaron Kalb appeared first on Alation.

article thumbnail

What I Love about Scrum for Data Science

Eugene Yan

Initially, I didn't like it. But over time, it grew on me. Here's why.

article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Four truths of resilient tech stacks

Twilio Segment

David Raab, the founder of the CDP Institute, shared a 4-step process for how organizations achieve resilience. Learn the truths and hear examples from Segment STACKED experts on how their companies live and breathe each step of this resilience process.

52
article thumbnail

Edge Computing: IT meets Industry

DataCentric podcast

Enterprise IT is rapidly moving towards the edge, but what does that really mean? There are extremes to what defines edge. Edge can describe the safe environment of a remote branch office, or maybe the air-conditioned cushiness of a remote telco hut. Then there's the extreme edge, where the lines between IT and OT blur, and anything can go. Jason Andersen, Stratus Technologies' vice president of business line management, joins Moor Insights & Strategy senior technology analysts Matt

52

More Trending

article thumbnail

How heads of data can jumpstart machine learning without hiring

Dataconomy

Read on the extract of the guide “6 steps to jumpstart machine learning using the resources you already have” written by Explorium – it explains how senior data professionals can enable data science in their organizations with the resources they have available, how to use cutting-edge technology to make the. The post How heads of data can jumpstart machine learning without hiring appeared first on Dataconomy.

article thumbnail

Prodigy v1.10: Dependencies, relations, audio, video & more

Explosion

Version 1.10 of Prodigy includes tons of new features, including manual dependency and relation annotation, audio and video annotation, a new and improved image UI, new recipe callbacks, more settings for manual NER, plus various new config options and settings.

52
article thumbnail

How to Set Up a Python Project For Automation and Collaboration

Eugene Yan

After this article, we'll have a workflow of tests and checks that run automatically with each git push.

Python 100
article thumbnail

Data validation for NLP applications with topic models

Depends on the Definition

In a recent article, we saw how to implement a basic validation pipeline for text data. Once a machine learning model has been deployed its behavior must be monitored. The predictive performance is expected to degrade over time as the environment changes.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Securing the Infrastructure Supply Chain

DataCentric podcast

You don't always think about the security of your vendor's supply chain as a part of your organization's cybersecurity and compliance efforts -- but you should. Everything from ensuring the authenticity of components, all the way through the potential exploitation of systems & subsystems in-transit can cause enterprise's damage. This episode has hosts Steve McDowell and Matt Kimball, both senior analysts at Moor Insights & Strategy, talking with John Grosso, Hewlett Packa

52
article thumbnail

10 Compelling Reasons you Should Use JupyterLab for Data Science Coding

Analytics Vidhya

Overview JupyterLab is a brilliant coding environment to perform data science tasks These 10 reasons will convince to switch to JupyterLab from Jupyter Notebooks. The post 10 Compelling Reasons you Should Use JupyterLab for Data Science Coding appeared first on Analytics Vidhya.

article thumbnail

Want to learn Data Science? Start with this course

Dataconomy

Choosing a Bootcamp in person or online is a tough decision, most people are looking for a career change that could boost their employment opportunities and salary. At Dataconomy we spoke to Ariadna Cuffi, a student from the Allwomen data science course for a detailed review. Allwomen have courses in. The post Want to learn Data Science? Start with this course appeared first on Dataconomy.

article thumbnail

Web Security 101: Cross-Site Scripting (XSS) Attacks

Victor Zhou

Cross-Site Scripting (XSS) vulnerabilities are one of the most dangerous web security holes that exist. In this post, we’ll see an interactive demo of XSS and learn how to protect against it. This is the second post in my Web Security 101 series. If you’ve read my introduction to CSRF, some of the preamble below might look familiar… feel free to skip ahead a bit.

52
article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

What I Do Before a Data Science Project to Ensure Success

Eugene Yan

Haste makes waste. Diving into a data science problem may not be the fastest route to getting it done.

article thumbnail

Searching for Answers

Alation

The post Searching for Answers appeared first on Alation.

86
article thumbnail

How to identify and prioritize high-value support tickets

Twilio Segment

With this recipe, you’ll learn how to identify support tickets by pricing plan so you can prioritize your response times accordingly.

52
article thumbnail

8 SQL Techniques to Perform Data Analysis for Analytics and Data Science

Analytics Vidhya

Overview SQL is a must-know language for anyone in analytics or data science Here are 8 nifty SQL techniques for data analysis that ever. The post 8 SQL Techniques to Perform Data Analysis for Analytics and Data Science appeared first on Analytics Vidhya.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

I Reverse Engineered Monzo’s Content Marketing Strategy: Here’s What Marketers Can Learn

Dataconomy

App-only challenger banks have risen to try and push incumbents off their financial perch, and in this battle, content marketing has been a valuable tool in their repertoires. Have you ever wondered how they do it? If you’ve never heard of Monzo, it’s a UK-based challenger bank that has grown. The post I Reverse Engineered Monzo’s Content Marketing Strategy: Here’s What Marketers Can Learn appeared first on Dataconomy.

212
212
article thumbnail

RL Unplugged: Benchmarks for Offline Reinforcement Learning

DeepMind

We propose a benchmark called RL Unplugged to evaluate and compare offline RL methods. RL Unplugged includes data from a diverse range of domains including games (e.g., Atari benchmark) and simulated motor control problems (e.g. DM Control Suite). The datasets include domains that are partially or fully observable, use continuous or discrete actions, and have stochastic vs. deterministic dynamics.

44
article thumbnail

My Notes From Spark+AI Summit 2020 (Application-Agnostic Talks)

Eugene Yan

Sharing my notes & practical knowledge from the conference for people who don't have the time.

AI 100
article thumbnail

Four in a Row: Customers Give Alation the Top-Rank in Dresner’s 2020 Wisdom of Crowds® Data Catalog Market Study

Alation

The post Four in a Row: Customers Give Alation the Top-Rank in Dresner’s 2020 Wisdom of Crowds® Data Catalog Market Study appeared first on Alation.

article thumbnail

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

Speaker: Kevin Kai Wong, President of Emergent Energy Solutions

In today's industrial landscape, the pursuit of sustainable energy optimization and decarbonization has become paramount. Manufacturing corporations across the U.S. are facing the urgent need to align with decarbonization goals while enhancing efficiency and productivity. Unfortunately, the lack of comprehensive energy data poses a significant challenge for manufacturing managers striving to meet their targets.

article thumbnail

The A1 Telekom Austria Hack

Christian Haschek

On the 3rd of February 2020 I received an encrypted email on 3 of my email addresses from a person

52
article thumbnail

Hands-on NLP Project: A Comprehensive Guide to Information Extraction using Python

Analytics Vidhya

Overview Information extraction is a powerful NLP concept that will enable you to parse through any piece of text Learn how to perform information. The post Hands-on NLP Project: A Comprehensive Guide to Information Extraction using Python appeared first on Analytics Vidhya.

Python 398
article thumbnail

Employee Spend During the Pandemic: What Can Organizations Learn?

Dataconomy

It’s essential for organizations to be aware of shifts in employee spend that invite different risks. The data scientists at Oversight, an enterprise spend management software, reveal the ongoing spend shifts organizations face and how the pandemic has quickly reshaped travel and expense employee spend. A high-level look at the. The post Employee Spend During the Pandemic: What Can Organizations Learn?

article thumbnail

Applying for technical roles

DeepMind

It’s no secret that the gender gap still exists within STEM. Despite a slight increase in recent years, studies show that women only make up about a quarter of the overall STEM workforce in the UK. While the reasons vary, many women report feeling held back by a lack of representation, clear opportunities and information on what working in the sector actually involves.

44
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Why Are My Airflow Jobs Running “One Day Late”?

Eugene Yan

A curious discussion made me realize my expert blind spot. And no, Airflow is not late.

100
100
article thumbnail

Deliver on the Promise of the Cloud with Alation and Databricks

Alation

The post Deliver on the Promise of the Cloud with Alation and Databricks appeared first on Alation.

article thumbnail

Introducing spaCy v2.3

Explosion

spaCy now speaks Chinese, Japanese, Danish, Polish and Romanian! Version 2.3 of the spaCy Natural Language Processing library adds models for five new languages. We've also updated all 15 model families with word vectors and improved accuracy, while also decreasing model size and loading times for models with vectors.

article thumbnail

3 Building Blocks of Machine Learning you Should Know as a Data Scientist

Analytics Vidhya

Overview A machine learning system consists of multiple building blocks that need to be managed Learn about the three key building blocks of machine. The post 3 Building Blocks of Machine Learning you Should Know as a Data Scientist appeared first on Analytics Vidhya.

article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.