7 High Paying Side Hustles for Data Scientists
KDnuggets
OCTOBER 9, 2023
This article serves as a guide for the data professional who wants to earn more in these trying times.
KDnuggets
OCTOBER 9, 2023
This article serves as a guide for the data professional who wants to earn more in these trying times.
insideBIGDATA
OCTOBER 11, 2023
The team here at insideBIGDATA is deeply entrenched in keeping the pulse of the big data ecosystem of companies from around the globe. We’re in close contact with the movers and shakers making waves in the technology areas of big data, data science, machine learning, AI and deep learning. Our in-box is filled each day with new announcements, commentaries, and insights about what’s driving the success of our industry so we’re in a unique position to publish our quarterly IMPACT 50 List.
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
databricks
OCTOBER 12, 2023
In this blog post, the MosaicML engineering team shares best practices for how to capitalize on popular open source large language models (LLMs).
Analytics Vidhya
OCTOBER 13, 2023
Introduction In today’s ever-advancing world of technology, there’s an exciting development on the horizon – Advanced Multi-modal Generative AI. This cutting-edge technology is about making computers more innovative and great, creating content and understanding. Imagine a digital assistant that seamlessly works with text, images, and sounds and generates information.
Speaker: Tamara Fingerlin, Developer Advocate
Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.
KDnuggets
OCTOBER 9, 2023
Unlock the power of GPT-4 summarization with Chain of Density (CoD), a technique that attempts to balance information density for high-quality summaries.
insideBIGDATA
OCTOBER 10, 2023
In this contributed article, Frank Laura, Chief Technology Officer at EngageSmart (NYSE: ESMT), discusses why CIOs and CTOs need to bring AI into businesses safely, securely, and legally. AI will enable CIOs and their teams to shift focus away from tactical and/or repetitive work towards creating innovative solutions for their teams and customers.
Data Science Current brings together the best content for data science professionals from the widest variety of thought leaders.
Analytics Vidhya
OCTOBER 12, 2023
Introduction In the field of artificial intelligence, Large Language Models (LLMs) and Generative AI models such as OpenAI’s GPT-4, Anthropic’s Claude 2, Meta’s Llama, Falcon, Google’s Palm, etc., have revolutionized the way we solve problems. LLMs use deep learning techniques to perform natural language processing tasks. This article will teach you to build LLM Apps […] The post How to Build LLM Apps Using Vector Database?
KDnuggets
OCTOBER 13, 2023
A new deep learning framework built entirely in Rust that aims to balance flexibility, performance, and ease of use for researchers, ML engineers, and developers.
insideBIGDATA
OCTOBER 7, 2023
In this contributed article, Gordon McKenna, VP of Cloud Evangelist & Alliances at Ensono, discusses the situation with Microsoft strongly backing OpenAI, what can we expect the future to look like? Microsoft’s investment into OpenAI was a clear move for the company to align itself with the next killer app that would drive engagement on Azure cloud.
databricks
OCTOBER 10, 2023
We’re excited to announce that Meta AI’s Llama 2 foundation chat models are available in the Databricks Marketplace for you to fine-tune and dep.
Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage
There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.
Analytics Vidhya
OCTOBER 11, 2023
Analytics Vidhya’s ‘Leading With Data’ is a series of interviews where industry leaders share their experiences, career journeys, interesting projects, and more. In the 5th episode of the series, we are joined by a very special guest – Mr. Srikanth Valamakanni. He is the Group CEO, Co-founder, and Vice-Chairman of Fractal Analytics, one of the […] The post Leading With Data: Building a Data Driven Organization with Srikanth Velamakanni appeared first on Analytics Vidhya.
KDnuggets
OCTOBER 12, 2023
This article talks about several best practices for writing ETLs for building training datasets. It delves into several software engineering techniques and patterns applied to ML.
insideBIGDATA
OCTOBER 13, 2023
In this contributed article, lead systems and DevOps engineer Manish Sharma discusses how platform engineering is a constantly developing field, and the advent of AI will likely accelerate the pace of change. Engineers can prepare for and adapt to the coming shifts by keeping up to date with developments in AI, including the increasing number of available AI tools and their applications for platform engineering.
Data Science Dojo
OCTOBER 11, 2023
In today’s world, technology is evolving at a rapid pace. One of the advanced developments is edge computing. But what exactly is it? And why is it becoming so important? This article will explore edge computing and why it is considered the new frontier in international data science trends. Understanding edge computing Edge computing is a method where data processing happens closer to where it is generated rather than relying on a centralized data-processing warehouse.
Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives
Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri
Analytics Vidhya
OCTOBER 7, 2023
Introduction In the ever-evolving landscape of artificial intelligence, one name has stood out prominently in recent years: transformers. These powerful models have transformed the way we approach generative tasks in AI, pushing the boundaries of what machines can create and imagine. In this article, we will delve into the advanced applications of transformers in generative […] The post Unlocking Creativity with Advanced Transformers in Generative AI appeared first on Analytics Vidhya.
KDnuggets
OCTOBER 9, 2023
In this article, Luis shares with readers his thoughts on the intersection of open source software and machine learning and what the future might bring. Many articles cover how open source software is used by the machine learning community but this post focuses on the similarities between the two areas of practice and what machine learning can and can’t learn from open source software.
databricks
OCTOBER 11, 2023
We are delighted to announce that Databricks Asset Bundles are now in public preview. Bundles, for short, facilitate the adoption of software engineering.
Data Science Dojo
OCTOBER 12, 2023
Data erasure is a software-based process that involves data sanitization or in plain words ‘data wiping’ so that no traces of data remain recoverable. This helps with the prevention of data leakage and the protection of sensitive information like trade secrets, intellectual property, or customer information. By 2025, it is estimated that data will grow up to 175 Zettabytes, and with great data comes great responsibility.
Speaker: Frank Taliano
Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.
Analytics Vidhya
OCTOBER 11, 2023
In a groundbreaking move reshaping the landscape of artificial intelligence, OpenAI has unveiled GPT-4 with vision, aptly named GPT-4V. This new iteration empowers users to harness the combined might of language and visual data. Thus unlocking unprecedented capabilities that promise to revolutionize our interactions with AI. Here, we delve into this latest advancement and explore […] The post OpenAI’s GPT-4V(ision): A Breakthrough in AI’s Multimodal Frontier appeared first on A
KDnuggets
OCTOBER 11, 2023
RNN, Transformers, and BERT are popular NLP techniques with tradeoffs in sequence modeling, parallelization, and pre-training for downstream tasks.
databricks
OCTOBER 9, 2023
We’re excited to announce that Databricks has obtained the International Standards Organization (ISO) 27701 certification as a data processor. This certification reflects our c.
insideBIGDATA
OCTOBER 10, 2023
In this contributed article, Anthony Chong, CEO/Co-Founder of IKASI, discusses the three types of machine learning approaches, the benefits and requirements of each, and offer examples of how organizations are applying these tactics to address real world business challenges.
Speaker: Chris Townsend, VP of Product Marketing, Wellspring
Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?
Analytics Vidhya
OCTOBER 11, 2023
Introduction When we hear data science, the first thing that comes to mind is building a model on notebooks and training the data. But this is not the situation in real-world data science. In the real world, data scientists build models and put them into production. The production environment has a gap between the development, […] The post A MLOps-Enhanced Customer Churn Prediction Project appeared first on Analytics Vidhya.
KDnuggets
OCTOBER 13, 2023
Let’s explore Data Mesh, a modern approach to data architecture that decentralizes data ownership and management.
databricks
OCTOBER 13, 2023
This blog was written in collaboration with David Roberts (Analytics Engineering Manager), Kevin P. Buchan Jr (Assistant Vice President, Analytics), and Yubin Park.
Eugene Yan
OCTOBER 8, 2023
I give one talk a year and in 2023 this is that talk.
Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage
When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m
Analytics Vidhya
OCTOBER 12, 2023
Introduction Machine learning is a highly developing domain of technology at present. This technology allows computer systems to learn and make decisions without technical programming. It has a variety of applications, including recognizing patterns, data analysis, and improving performance over time. This guide on how to learn machine learning online will introduce you to the […] The post How to Learn Machine Learning Online?
KDnuggets
OCTOBER 9, 2023
Unlock the power of time-based data visualization with Pandas as we delve into the art of resampling, turning your data into insightful temporal masterpieces.
databricks
OCTOBER 9, 2023
Written in partnership with Shell. The energy industry is all about physical assets – from terminals, ships and pipelines to refineries and wind f.
insideBIGDATA
OCTOBER 8, 2023
In this video presentation, our good friend Jon Krohn, Co-Founder and Chief Data Scientist at the machine learning company Nebula, is joined by Dr. Allen Downey, renowned author and professor, who shares insights from his upcoming book 'Probably Overthinking It,' breaking down underused techniques like Survival Analysis, explaining common paradoxes, discussing the dynamic Overton Window, and how to be prepared for Black Swan events.
Speaker: Tamara Fingerlin, Developer Advocate
In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!
Let's personalize your content