August, 2022

article thumbnail

7 Techniques to Handle Imbalanced Data

KDnuggets

This blog post introduces seven techniques that are commonly applied in domains like intrusion detection or real-time bidding, because the datasets are often extremely imbalanced.

article thumbnail

Building a simple Flask App using Docker vs Code

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction More often than not, developers run into issues of an application running on one machine versus not running on another. Dockers help prevent this by ensuring the application runs on any machine if it works on yours. Simply put, if your job as […]. The post Building a simple Flask App using Docker vs Code appeared first on Analytics Vidhya.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The best online master’s degrees in AI for you to apply

Dataconomy

What are the best masters in artificial intelligence online? The AI degree is a relatively new academic pursuit as a subfield of machine learning. Several artificial intelligence applications can be implemented using the advanced programming skills that students will master with this degree. Engineering and other topics are frequently covered.

article thumbnail

$60/MWh for advanced nuclear electricity is achievable, says GE Hitachi executive

Hacker News

At that price, there would be demand for about 90 GW of emerging small reactors in the U.S. by 2050, according to a recent Nuclear Energy Institute survey.

165
165
article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Mapping how far you can travel by train in five hours, from any European station

FlowingData

This European travel map by Benjamin Td shows how far you can travel in five hours, given a station location. Just hover over the map, and you see the areas, or isochrones that are reachable in five hours, assuming 20 minutes for interchanges. The project is based on data from Deutsch Bahn, and was inspired by a more dotty map by Julius Tens. It reminds me of Tom Carden’s (now Flash-retired) travel time map from 2008.

139
139
article thumbnail

7 Enterprise Applications for Companies Using Cloud Technology

Smart Data Collective

The market for cloud technology is booming. Companies spent over $405 billion on cloud services last year. The sudden growth is not surprising, because the benefits of the cloud are incredible. Enterprise cloud technology applications are the future industry standard for corporations. Cloud computing has found its way into many business scenarios and is a relatively new concept for businesses.

More Trending

article thumbnail

Introduction to Requests Library in Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Requests in Python is a module that can be used to send all kinds of HTTP requests. It is straightforward to use and is a human-friendly HTTP Library. Using the requests library; we do not need to manually add the query string […]. The post Introduction to Requests Library in Python appeared first on Analytics Vidhya.

Python 400
article thumbnail

Machine learning makes life easier for data scientists

Dataconomy

The much-awaited comparison is finally here: machine learning vs data science. The terms “data science” and “machine learning” are among the most popular terms in the industry in the twenty-first century. These two methods are being used by everyone, from first-year computer science students to large organizations like Netflix and.

article thumbnail

On Technique

O'Reilly Media

In a previous article , I wrote about how models like DALL-E and Imagen disassociate ideas from technique. In the past, if you had a good idea in any field, you could only realize that idea if you had the craftsmanship and technique to back it up. With DALL-E, that’s no longer true. You can say, “Make me a picture of a lion attacking a horse,” and it will happily generate one.

Python 106
article thumbnail

Introduction to Probability for Data Science, a free book

FlowingData

Introduction to Probability for Data Science is a free-to-download book by Purdue statistics professor Stanley H. Chan: We need a book that balances the theory and practice. We need a book that provides insights and not just theorems and proofs. We need a book that motivates the students, telling them why probability is so essential to their work. We need a book that highlights the impacts of the subject.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

5 Ways B2B Companies Can Use Analytics for Pricing

Smart Data Collective

Analytics technology is very important for modern business. Companies spent over $240 billion on big data analytics last year. That figure is expected to grow as more businesses discover its benefits. There are many important applications of data analytics technology. One of the most important is with helping companies set their prices correctly. Analytics Can Be Essential for Helping Companies with their Pricing Strategies.

Analytics 129
article thumbnail

Data Transformation: Standardization vs Normalization

KDnuggets

Increasing accuracy in your models is often obtained through the first steps of data transformations. This guide explains the difference between the key feature scaling methods of standardization and normalization, and demonstrates when and how to apply each approach.

article thumbnail

Most Frequently Asked NLP Interview Questions

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Natural language processing (NLP) is the branch of computer science and, more specifically, the domain of artificial intelligence (AI) that focuses on providing computers the ability to understand written and spoken language in a way similar to that of humans. Combining computational linguistics […].

article thumbnail

The data governance framework is an indispensable compass of the digital age

Dataconomy

To achieve business results, all businesses must establish a data governance framework that ensures that data is treated similarly across the organization. Without effective data governance, tracking when and from where erroneous data enters your systems and who is utilizing it is impossible.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Text AI Updates Drive Faster Business Value

DataRobot Blog

How can you save time in understanding the impact of language when working with text in ML models ? With tens of thousands of Text AI projects, DataRobot has helped organizations unlock insights from text and generate predictions with text models—from assisting with customer support ticket triage to predicting real estate sale prices. Continuing to build on previously released Text AI capabilities, DataRobot AI Cloud introduces new features to help with language detection, blueprint optimization

AI 105
article thumbnail

Finding illegal airstrips in Brazil

FlowingData

Using a combination of satellite imagery, crowdsourced databases, and analyses, The New York Times identified airstrips used for illegal mining in Brazil : To confirm these locations and connect them with illicit mining, Times reporters built a tool to help analyze thousands of satellite images. They examined historical satellite imagery to determine that 1,269 unregistered airstrips still appeared in active use within the past year.

Database 124
article thumbnail

The Future of AI: High Quality, Human Powered Data

Smart Data Collective

Artificial Intelligence (AI) has significantly altered how work is done. However, AI even has a bigger impact by enhancing human capabilities. Research conducted by the Harvard Business Review found that the interaction between machines and humans significantly improves firms’ performance. Successful collaboration between humans and machines enhances each other’s strengths, including teamwork, leadership, creativity, speed, scalability, and quantitative capabilities.

article thumbnail

The Importance of Experiment Design in Data Science

KDnuggets

Do you feel overwhelmed by the sheer number of ideas that you could try while building a machine learning pipeline? You can not take the liberty of trying all possible ways to arrive at a solution - hence we discuss the importance of experiment design in data science projects.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Multi-variate Time Series Forecasting using Kats Model

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Kats model-which is also developed by Facebook Research Team-supports the functionality of multi-variate time-series forecasting in addition to univariate time-series forecasting. Often we need to forecast a time series where we have input variables in addition to ‘time’; this is where the […].

article thumbnail

AI in the manufacturing market will rise by 14 billion dollars in 5 years (Learn why)

Dataconomy

What is the role of artificial intelligence in manufacturing? Well, there are a lot of use cases for artificial intelligence in everyday life, but what about AI in manufacturing? The effects of artificial intelligence in business heavily include manufacturing. Are you scared of AI jargon? We have already created a detailed AI glossary for the.

article thumbnail

Using Data Analytics to Understand Gross Margin Attribution

Dataversity

Companies across sectors and of all sizes face a common problem: understanding their profit margins and identifying the levers that are attributing to those margins. It’s as fundamental to business operations as you can get – if the margin isn’t there, you’re not going to have a viable business, and in an increasingly data-driven world, businesses that […].

article thumbnail

Most notable person, everywhere in the world

FlowingData

Who is the most famous person born in the place you live? This interactive map by Topi Tjukanov lets you answer that question for anywhere in the world. The pool of possible people comes from a cross-verified database of 2.29 million people, based on Wikipedia entries and Wikidata. You can also see the most notable person per category: culture, science, leadership, and sports.

Database 102
article thumbnail

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

Speaker: Kevin Kai Wong, President of Emergent Energy Solutions

In today's industrial landscape, the pursuit of sustainable energy optimization and decarbonization has become paramount. Manufacturing corporations across the U.S. are facing the urgent need to align with decarbonization goals while enhancing efficiency and productivity. Unfortunately, the lack of comprehensive energy data poses a significant challenge for manufacturing managers striving to meet their targets.

article thumbnail

What is the IoT and How is it Changing the World?

Smart Data Collective

The Internet of Things (IoT) has been on the rise in recent years, and it’s becoming more and more common among consumers, businesses, and governments alike. IoT refers to any connected physical device that can send or receive data over the internet, including smartphones, computers, speakers, security cameras, thermostats, door locks, vehicles—the list goes on and on.

article thumbnail

How Do Data Scientists and Data Engineers Work Together?

KDnuggets

If you’re considering a career in data science, it’s important to understand how these two fields differ, and which one might be more appropriate for someone with your skills and interests.

article thumbnail

Comprehensive Guide to Edge Detection Algorithms

Analytics Vidhya

Introduction Image processing is a widely used concept to exploit the information from the images. Image processing algorithms take a long time to process the data because of the large images and the amount of information available in it. So, in these edge-cutting techniques, it is necessary to reduce the amount of information that the […]. The post Comprehensive Guide to Edge Detection Algorithms appeared first on Analytics Vidhya.

Algorithm 398
article thumbnail

PaddlePaddle deep learning framework expands AI to industrial applications

Dataconomy

PaddlePaddle has recently received new updates from Baidu, along with 10 large deep learning models covering computational biology, vision, and natural language processing. Despite the growing popularity of more recent innovations, TensorFlow, PyTorch, and Keras remain the three deep-learning frameworks that have long dominated AI. The most widely used Chinese.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

How to Implement a Data Quality Framework

Dataversity

According to IDC, 30-50% of businesses experience gaps between their data expectations and reality. They have the data they need, but due to the presence of intolerable defects, they cannot use it as needed. These defects – also called Data Quality issues – must be fetched and fixed so that data can be used for successful business […]. The post How to Implement a Data Quality Framework appeared first on DATAVERSITY.

article thumbnail

AI in Supply Chain — A Trillion Dollar Opportunity

DataRobot Blog

Supply chain and logistics industries worldwide lose over $1 trillion a year due to out-of-stock or overstocked items 1. Shifting demands and shipping difficulties make the situation worse. Challenges in inventory management, demand forecasting, price optimization, and more can result in missed opportunities and lost revenue. The retail marketplace has become increasingly complex and competitive.

AI 98
article thumbnail

How AI Software is Changing the Future of the Automotive Industry

Smart Data Collective

Artificial intelligence technology is changing the future of many industries. Global companies spent over $328 billion on AI last year. This figure is expected to grow as more companies recognize the potential and decide to increase the resources they dedicate to machine learning and predictive analytics tools. The automotive industry is among those investing in AI the most.

AI 124
article thumbnail

Most In-demand Artificial Intelligence Skills To Learn In 2022

KDnuggets

Artificial Intelligence (AI) is the process of programming a computer that can reason and learn like a human being and make decisions for itself.

article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.