May, 2017

article thumbnail

Big Data for Humans: The Importance of Data Visualization

Dataconomy

Everyone has heard the old moniker garbage in – garbage out. It is a simple way of saying that machine learning is only as good as the data, algorithms, and human experience that goes into them. But even the best results can be thought of as garbage if no one. The post Big Data for Humans: The Importance of Data Visualization appeared first on Dataconomy.

article thumbnail

OMSCS CS6476 (Computer Vision) Review and Tips

Eugene Yan

OMSCS CS6476 Computer Vision - Performing computer vision tasks with ONLY numpy.

130
130
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Reflections on running spaCy: commercial open-source NLP

Explosion

As more and more people and companies are getting involved with open-source software, balancing the expectations of an open community and a traditional provider vs. consumer relationship is becoming increasingly difficult. Are maintainers becoming too authoritarian? Are users becoming too demanding? Are large companies selling out open-source?

52
article thumbnail

Reflections on running spaCy: commercial open-source NLP

Ines Montani

As more and more people and companies are getting involved with open-source software, balancing the expectations of an open community and a traditional provider vs. consumer relationship is becoming increasingly difficult. Are maintainers becoming too authoritarian? Are users becoming too demanding? Are large companies selling out open-source? In this post I’ll share some lessons we’ve learned from running spaCy , the popular and fast-growing library for Natural Language Processing in Python.

article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

The Business Implications of Machine Learning

Dataconomy

As buzzwords become ubiquitous they become easier to tune out. We’ve finely honed this defense mechanism, for good purpose. It’s better to focus on what’s in front of us than the flavor of the week. CRISPR might change our lives, but knowing how it works doesn’t help you. VR could. The post The Business Implications of Machine Learning appeared first on Dataconomy.

article thumbnail

Blockchains could be every Data Scientist’s dream

Dataconomy

Bitcoin is currently trading at over $1250 and if you are someone who invested a grand in bitcoins back in 2011, your investments are potentially worth over $600K. The most valuable contribution of the bitcoin community is not in the financial returns itself, but in the introduction of blockchain technology. The post Blockchains could be every Data Scientist’s dream appeared first on Dataconomy.

More Trending

article thumbnail

Confused by data visualization? Here’s how to cope in a world of many features

Dataconomy

The late data visionary Hans Rosling mesmerised the world with his work, contributing to a more informed society. Rosling used global health data to paint a stunning picture of how our world is a better place now than it was in the past, bringing hope through data. Now more than. The post Confused by data visualization? Here’s how to cope in a world of many features appeared first on Dataconomy.

article thumbnail

Three Mistakes that Set Data Scientists up for Failure

Dataconomy

The rise of the data scientists continues and social media is filled with success stories – but what about those who fail? There are no cover articles praising the failures of the many data scientists that don’t live up to the hype and don’t meet the needs of their stakeholders. The post Three Mistakes that Set Data Scientists up for Failure appeared first on Dataconomy.

article thumbnail

Data Mining for Social Intelligence – Opinion data as a monetizable resource

Dataconomy

The digital age is characterised increasingly by the collective. The centralised database is being superseded by the blockchain; expert opinion yields ever more to the insights of the crowd. The information generated by tapping into the minds of many is driving decisions in both the public and private sector; market. The post Data Mining for Social Intelligence – Opinion data as a monetizable resource appeared first on Dataconomy.

article thumbnail

How to use ElasticSearch for Natural Language Processing and Text Mining — Part 2

Dataconomy

Welcome to Part 2 of How to use Elasticsearch for Natural Language Processing and Text Mining. It’s been some time since Part 1, so you might want to brush up on the basics before getting started. This time we’ll focus on one very important type of query for Text Mining. The post How to use ElasticSearch for Natural Language Processing and Text Mining — Part 2 appeared first on Dataconomy.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Machine Learning using Spark and R

Dataconomy

R is ubiquitous in the machine learning community. Its ecosystem of more than 8,000 packages makes it the Swiss Army knife of modeling applications. Similarly, Apache Spark has rapidly become the big data platform of choice for data scientists. Its ability to perform calculations relatively quickly (due to features like in-memory. The post Machine Learning using Spark and R appeared first on Dataconomy.

article thumbnail

Data Nirvana – How to develop a data-driven culture

Dataconomy

As the creation and consumption of data continues to grow among businesses of all sizes, so does the challenge of analyzing and turning that data into actionable insights. According to IBM, 90 percent of the data in the world today has been created in the last two years, at 2.5. The post Data Nirvana – How to develop a data-driven culture appeared first on Dataconomy.

Big Data 183
article thumbnail

How to do time series prediction using RNNs, TensorFlow and Cloud ML Engine

Dataconomy

The Estimators API in tf.contrib.learn (See tutorial here) is a very convenient way to get started using TensorFlow. The really cool thing from my perspective about the Estimators API is that using it is a very easy way to create distributed TensorFlow models. Many of the TensorFlow samples that you. The post How to do time series prediction using RNNs, TensorFlow and Cloud ML Engine appeared first on Dataconomy.

ML 182
article thumbnail

Frequency Distribution Analysis using Python Data Stack – Part 1

Dataconomy

During my years as a Consultant Data Scientist I have received many requests from my clients to provide a frequency distribution reports for their specific business data needs. These reports have been very useful for the company management to make proper business decisions quickly. In this paper I would like. The post Frequency Distribution Analysis using Python Data Stack – Part 1 appeared first on Dataconomy.

Python 172
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Boost Your Data Wrangling with R

Dataconomy

The R language is often perceived as a language for statisticians and data scientists. Quite a long time ago, this was mostly true. However, over the years the flexibility R provides via packages has made R into a more general purpose language. R was open sourced in 1995, and since. The post Boost Your Data Wrangling with R appeared first on Dataconomy.

article thumbnail

Dare to Share in The Cloud: How Secure Is Your Data?

Dataconomy

The march to the cloud for mission-critical applications is picking up speed. Even financial services firms, noted for their caution, are making headway. UK-based insurance intermediaryTowergate Insurance announced last year that it is moving its IT infrastructure to the cloud. And The Wall Street Journal reported in June 2016 that. The post Dare to Share in The Cloud: How Secure Is Your Data?

Big Data 157
article thumbnail

Amazon Kinesis vs. Apache Kafka For Big Data Analysis

Dataconomy

Data processing today is done in form of pipelines which include various steps like aggregation, sanitization, filtering and finally generating insights by applying various statistical models. Amazon Kinesis is a platform to build pipelines for streaming data at the scale of terabytes per hour. Parts of the Kinesis platform are. The post Amazon Kinesis vs.

article thumbnail

Keep it real?—?say no to algorithm porn!

Dataconomy

For people in the know, machine learning is old hat. Even so, it’s set to become the data buzzword of the year — for a rather mundane reason. When things get complex, people expect technology to ‘automagically’ solve the problem. Whether it’s automated financial product consultation or shopping in the supermarket of. The post Keep it real — say no to algorithm porn!

article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.