2015

article thumbnail

Most HR Data Is Bad Data

Hacker News

'We are sorry. This site is optimized for Internet Explorer 9 and above. Please update your browser. We are sorry. This site is optimized for Safari 7 and above. Please update your browser. 4 / 5 Free Articles left Remaining. Register for more. |. Subscribe + Save! Register. Sign In. Sign In. Your cart. Your Shopping Cart is empty. Visit our store. Go to Cart.

Analytics 140
article thumbnail

DataScience SG Meetup - How we got top 3% in Kaggle

Eugene Yan

Sharing about my first data science competition at DataScience SG.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

My company just turned 10 and it was quite a ride

Christian Haschek

Hello readers!

52
article thumbnail

I fell for self-help so you don’t have to

Ines Montani

I hate to admit it, but during my late teenage years and the beginning of my adult life, I massively fell for self-help blogs. I was unhappy, didn’t particularly like myself and had no idea where I would end up one day so I started spending hours on the internet looking for “answers”. (Yeah, this probably sounds familiar.) I was never particularly interested in conventional luxuries - think fancy hotels, designer s**t, glitter and cupcakes - but for a while I believed that this is what I needed

AWS 52
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Introducing spaCy

Explosion

spaCy is a new library for text processing in Python and Cython. I wrote it because I think small companies are terrible at natural language processing (NLP). Or rather: small companies are using terrible NLP technology. This post shows the original launch announcement for spaCy , which came with some usage examples and benchmarks. The benchmarks are quite out of date, but I’m pleased to say usage has changed relatively little.

article thumbnail

DataKind Singapore’s Latest Project Accelerator

Eugene Yan

Guest post of how DataKind SG worked with NGOs to frame their problems and suggests solutions

100
100

More Trending

article thumbnail

What would happen if you'd embed a remote JPEG with a HTTP link but the remote server is forcing HTTPS?

Christian Haschek

A few months ago I started going HTTPS for all my sites.

52
article thumbnail

Raspberry Pi controlled power sockets

Christian Haschek

Disclaimer: I'm not an electrician and I would not use this proof-of-concept in reallife because

40
article thumbnail

That (not so) awesome time the police raided my home

Christian Haschek

A few years ago I saw the blooming of new IRC communities.

40
article thumbnail

Water cooled NAS made from spare parts

Christian Haschek

In this project I will build a water cooled NAS made from [my 51$ water cooling solution](https:

40
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

DIY: Cheap wall mounted, water cooled PC for 51$

Christian Haschek

[UPDATE] Test results are in!

40
article thumbnail

Let's analyze over twenty thousand proxies

Christian Haschek

In my last post I tested 443 proxies and analyzed how safe

40
article thumbnail

What would happen if you'd embed a remote JPEG with a HTTP link but the remote server is forcing HTTPS? -

Christian Haschek

A few months ago I started going HTTPS for all my sites. One of the sites I was particularly excited.

40
article thumbnail

Neue Reifeprüfung analyzed

Christian Haschek

This year was a special year for many teachers in Austria.

40
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Repairing a closed loop CPU water cooler with aquarium tubing

Christian Haschek

I had this broken KÜHLER H2O 620 clos

40
article thumbnail

Raspberry Pi 2 wireless temperature box

Christian Haschek

Two years ago I made a project where I used a Raspberry Pi

40
article thumbnail

Introducing spaCy

Explosion

Computers don't understand text. This is unfortunate, because that's what the web almost entirely consists of. We want to recommend people text based on other text they liked. We want to shorten text to display it on a mobile screen. We want to aggregate it, link it, filter it, categorise it, generate it and correct it. spaCy provides a library of utility functions that help programmers build such products.

40
article thumbnail

How spaCy Works

Explosion

This post was pushed out in a hurry, immediately after spaCy was released. It explains some of how spaCy is designed and implemented, and provides some quick notes explaining which algorithms were used. The post pre-dates spaCy's named entity recogniser, but it provides some detail about the tokenisation algorithm, general design, and efficiency concerns.

article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

How spaCy Works

Explosion

The following are some hasty preliminary notes on how spaCy works. The short story is, there are no new killer algorithms. The way that the tokenizer works is novel and a bit neat, and the parser has a new feature set, but otherwise the key algorithms are well known in the recent literature. This post was pushed out in a hurry, immediately after spaCy was released.

article thumbnail

Chromebooks for Work & Classrooms

Christian Haschek

When I first heard about Chromebooks a few years ago, I thought the Idea is awesome but I didn't

40
article thumbnail

5 things I learned from working in print

Ines Montani

For four years, I worked for a popular print magazine, developing marketing strategies, selling advertising space and coordinating partnerships. Being a monthly publication, we often jokingly compared it to giving birth once a month: at first you get all excited, then it becomes super hard and you want to die and then all of a sudden it’s over, you’re happy and hold the result in your hands and already forgot about all of the pain.

52
article thumbnail

Dead Code Should Be Buried

Explosion

Natural Language Processing moves fast, so maintaining a good library means constantly throwing things away. Most libraries are failing badly at this, as academics hate to editorialize. This post explains the problem, why it's so damaging, and why I wrote spaCy to do things differently.

article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

Dead Code Should Be Buried

Explosion

Natural Language Processing moves fast, so maintaining a good library means constantly throwing things away. Most libraries are failing badly at this, as academics hate to editorialize. This post explains the problem, why it’s so damaging, and why I wrote spaCy to do things differently. Imagine: you try to use Google Translate, but it asks you to first select which model you want.