Remove audio-amplifier
article thumbnail

Stable Video Diffusion

Hacker News

Spanning across modalities including image, language, audio, 3D, and code, our portfolio is a testament to Stability AI’s dedication to amplifying human intelligence. Stable Video Diffusion is a proud addition to our diverse range of open-source models.

182
182
article thumbnail

Build Your Own Hi-fi Ear Defenders

Hacker News

For years, I have been trying to improve my personal audio-monitoring situation without going to the expense of the systems used by professional touring bands, which include custom-molded earpieces. Also mounted on the board are the ESP32 microcontroller (5) and a volume control and audio jack (6). James Provost I’m a drummer.

179
179
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Two C64s Plus a Pile of Floppy Disks Equals One Accordion

Hacker News

One supporting board incorporates a microcontroller to measure the airflow and mix the audio signals, a second stores the accordion software and emulates a cassette player, and a third acts as a power hub. Air flowing into or out of the hole passes over the microphone, and the resulting turbulence turns into audio noise. James Provost.

171
171
article thumbnail

Google Gemini: The AI model by Google

Towards AI

It’s a natively multimodal AI, designed to seamlessly process text, images, audio, and code. Gemini Ultra showcases prowess in diverse areas: from mathematics to code generation, image and video understanding, and audio processing. Gemini’s integration into Google’s unified AI stack unlocks numerous opportunities.

AI 98
article thumbnail

Google Gemini: The AI model by Google

Towards AI

It’s a natively multimodal AI, designed to seamlessly process text, images, audio, and code. Gemini Ultra showcases prowess in diverse areas: from mathematics to code generation, image and video understanding, and audio processing. Gemini’s integration into Google’s unified AI stack unlocks numerous opportunities.

AI 93
article thumbnail

Toolify review: The popular AI tools directory

Dataconomy

Text-to-speech expands possibilities for audio-first content and personalized voice interfaces. Uberduck – Web tool for cloning voices or applying vocal effects like autotune to audio clips. These voice manipulation capabilities open new creative horizons for audio, entertainment, personalization, and more.

AI 113
article thumbnail

Home Alone 3 Kevin McCallister trailer “directed” by AI and hit the all right notes

Dataconomy

Guns replace paint cans, explosions amplify screams. It essentially involves manipulating existing audio or video recordings to make it appear as if someone is saying or doing something they never did. These models are trained on large datasets of images, audio, and video to learn the nuances of human appearance and speech.

AI 113