Sony Pixel Power calrec Sony

Rachel Bittner on Basic Pitch: An Open Source Tool for Musicians

01/09/2022

Music creation has never been as accessible as it is now. Gone are the days of classical composers, sheet music, and prohibitively expensive studio time when only trained, bankrolled musicians had the opportunity to transcribe notes onto a page. As technology has changed, so too has the art of music creation-and today it is easier than ever for experts and novices alike to compose, produce, and distribute music.

Now, musicians use a computer-based digital standard called MIDI (pronounced MID-ee ). MIDI acts like sheet music for computers, describing which notes are played and when-in a format that's easy to edit. But creating music from scratch, even using MIDI, can still be very tedious. If you play piano and have a MIDI keyboard, you can create MIDI by playing. But if you don't, you must create it manually: note by note, click by click.

To help solve this problem, Spotify's machine learning experts trained a neural network to predict MIDI note events when given audio input. The network is packaged in a tool called Basic Pitch, which we just released as an open source project.

Basic Pitch makes it easier for musicians to create MIDI from acoustic instruments-for example, by singing their ideas, says Rachel Bittner, a research manager at Spotify who is focused on applied machine learning on audio. It can also give musicians a quick starting point' transcription instead of having to write down everything manually, saving them time and resources. Basically, it allows musicians to compose on the instrument they want to compose on. They can jam on their ukulele, record it on their phone, then use Basic Pitch to turn that recording into MIDI. So we've made MIDI, this standard that's been around for decades, more accessible to more creators. We hope this saves them time and effort while also allowing them to be more expressive and spontaneous.

For the Record asked Rachel to tell us more about the thinking and development that go into Basic Pitch and other machine learning efforts, and how the team decided to open up the tool for anyone to access and to innovate on.

Help us understand the basics. How are machine learning models being applied to audio? Rachel Bittner

On the audio ML (machine learning) teams at Spotify, we build neural networks-like the ones that are used to recognize images or understand language-but ours are designed specifically for audio. Similar to how you ask your voice assistant to identify the words you're saying and also make sense of the meaning behind those words, we're using neural networks to understand and process audio in music and podcasts. This work combines our ML research and practices with domain knowledge about audio-understanding the fundamentals of how music works, like pitch, tone, tempo, the frequencies of different instruments, and more.

What are some examples of machine learning projects you're working on that align with our mission to give a million creators the opportunity to live off their art ? Spotify enables creators to reach listeners and listeners to discover new creators. A lot of our work helps with this in indirect ways-for example, identifying tracks that might go well together on a playlist because they share similar sonic qualities like instrumentation or recording style. Maybe one track is already a listener's favorite and the other one is something new they might like.

We also build tools that help creative artists actually create. Some of our tech is in Soundtrap, Spotify's digital audio workstation (DAW), which is used to produce music and podcasts. It's like having a complete studio online. And then there's Basic Pitch, which is a stand-alone tool for converting audio into MIDI that we just released as an open source project. We open sourced Basic Pitch and built an online demo, so anyone can use it to translate musical notes in a recording (including voice, guitar, or piano).

Unlike similar ML models, Basic Pitch is not only versatile and accurate at doing this, but it's also fast and computationally lightweight. So the musician doesn't have to sit around forever waiting for their recording to process. And on the technological and environmental side, it uses way less energy-we're talking orders of magnitude less-compared to other ML models. We named the project Basic Pitch because it can also detect pitch bends in the notes, which is a particularly tricky problem for this kind of model. But also because the model itself is so lightweight and fast.

What else makes Basic Pitch a unique machine learning project for Spotify? I mentioned before how computationally lightweight it is-that's a good thing. In my opinion, the ML industry tends to overlook the environmental and energy impact of their models. Usually with ML models like this-whether it's for processing images, audio, or text-you throw as much processing power as you can at the problem as the default method for reaching some level of accuracy. But from the beginning, we had a different approach in mind: We wanted to see if we could build a model that was both accurate and efficient, and if you have that mindset from the start, it changes the technical decisions you make in how you build the model. Not only is our model as accurate as (or even more accurate than) similar models, but since it's lightweight, it's also faster, which is better for the user, too.

What's the benefit of open sourcing this tool? It gives more people access to it since anyone with a web browser can use the online demo. Plus, we believe the external contributions from the open source community help it evolve as software to create a better, more useful product for everyone. For example, while we believe Basic Pitch solves an important problem, the quality of the MIDI that our system (and others') pro
LINK: https://newsroom.spotify.com/2022-09-01/rachel-bittner-on-basic-pitch-...
See more stories from spotify

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

27/06/2026

Through Their Lens: What Cinematographer Amy Vincent Saw at the 2026 Directors Lab

There's no doubt that you've seen the world through Amy Vincent's ey...

27/06/2026

UJAM release Retrocraft

Brings together saturation & lo-fi effects Following on from the release of their Voxcraft vocal-processing plug-in, UJAM have announced the launch of Retro...

27/06/2026

A record 4.84 million Australians choose SBS as the Socceroos advance at FIFA World Cup 2026

A record 4.84 million Australians choose SBS as the Socceroos advance at FIFA Wo...

27/06/2026

Apogee CRAS Symphony Mkii Education Feature Blog

Why CRAS Upgraded to Symphony I/O MK II When an audio school runs studios all day, every day, gear doesn't just need to sound good , it needs to survive rea...

27/06/2026

MultiDyne Acquires the Assets of MRMC

Share Copy link Facebook X Linkedin Bluesky Email...

27/06/2026

Spectrum Intelligence Ventures Launches Latis

Share Copy link Facebook X Linkedin Bluesky Email...

27/06/2026

Krotos Video to Sound Plugin Now Available for Adobe Premiere Pro

Krotos Video to Sound Plugin Now Available for Adobe Premiere Pro Brie Clayton June 26, 2026 0 Comments Editors can analyze footage, generate synchron...

27/06/2026

Mirai Media Elevates Digital and Broadcast Productions with Blackmagic Design

Mirai Media Elevates Digital and Broadcast Productions with Blackmagic Design Brie Clayton June 26, 2026 0 Comments Studio uses Ultimatte 12 HD and Po...

27/06/2026

Lutra Cafe & Bakery Opens At American Tobacco Campus

DURHAM, N.C. - JUNE 26, 2026 - Lutra Cafe & Bakery has opened its first brick-and-mortar location at American Tobacco Campus after owner Chris McLaurin operated...

26/06/2026

SVG GameDay, Ep. 21: Minnesota Vikings Allan Wertheimer - Large-Scale Shows in Minny

In-venue and creative video staffers at the professional and collegiate level ha...

26/06/2026

Strike Fighter League Announces Second Online Tournament, Set for July 25 in Las Vegas

Strike Fighter League (SFL), a professional air combat digital sport combining f...

26/06/2026

InfoComm 2026: Wisycom Announces MPR60 Firmware Update, MATF Antenna Matrix, and PFL RFoF Box

Wisycom has announced three new additions to its professional wireless ecosystem...

26/06/2026

Eurovision Services Inaugurates Expanded Master Control Room in Madrid

Eurovision Services inaugurated an expanded Master Control Room (MCR) in Madrid on June 1, 2026, building on a broadcast hub the company has operated in the cit...

26/06/2026

Midco Sports and University of North Dakota Renew Broadcast and Sponsorship Partnership

Midco Sports and the University of North Dakota (UND) have announced a two-year ...

26/06/2026

G&D and VuWall Appoint Vutec as Exclusive South Africa Distributor

Guntermann and Drunck (G&D) and VuWall, both part of the Panoptec Technologies Group, have appointed Vutec (Pty) Ltd as exclusive distributor for their KVM and ...

26/06/2026

Visit Seattle Launches Drone Scoreboard at Space Needle for FIFA World Cup 2026

Visit Seattle, the official destination marketing organization for Seattle and King County, has launched what it describes as the world's first drone scoreb...

26/06/2026

CP Communications Provides RF and Wireless Support for 2026 NBA Draft at Barclays Center

CP Communications provided RF video, audio, and crew communications support for ...

26/06/2026

Reimagined MoonPay X Games League Kicks Off With Three-Day Event in Sacramento

Produced by longtime partner Echo Entertainment, the action-sports property is now a team-based year-round league The inaugural season of the MoonPay X Games L...

26/06/2026

MultiDyne Acquires MRMC, Expands into Camera Robotics and Motion Control

The deal establishes MultiDyne Robotics and Motion Control, maintaining the well-known MRMC brand.MultiDyne Video & Fiber Optic Systems has acquired the assets ...

26/06/2026

TNT Sports Heads Into Year 2 of NASCAR Return With New NEP Truck, Expanded In-Car Experience

PX1 will debut at Sonoma as TNT leans into super-slo-mo, drones, SMT data integr...

26/06/2026

Ratings Roundup: USMNT-Australia Draws 23M Viewers; Mexico-South Korea Is Most-Watched Spanish-Language Soccer Match Ever

Ratings Roundup is a rundown of recent rating news and is derived from press rel...

26/06/2026

David Kuckhermann brings calabash to Celemony Tonalic

Virtual session musician plug-in gains new percussion options Celemony's latest update for their virtual session musician platform complements the exist...

26/06/2026

Softube unveil the Console 1 Compact

Half-size model joins Console 1 line-up Shortly after the release of their new Flow Studio controller, Softube have announced the launch of another new surf...

26/06/2026

ELT Group and Rohde & Schwarz sign a cooperation agreement to explore commercial opportunities in electromagnetic warfare and defense

ELT Group and Rohde & Schwarz sign a cooperation agreement to explore commercial...

26/06/2026

Lightware Powers Teddy Swims UK And Europe Tour With Adva...

For Teddy Swims sold-out I've Tried Everything But Therapy tour, event technology specialists, PRG, provided video, automation and lighting across 19 date...

26/06/2026

Taurus TPN powers AV workflows at NurnbergMesse

Modern exhibition and event venues face the challenge of seamlessly integrating traditional conference technology, professional broadcast workflows and IP-based...

26/06/2026

FCC Adopts New Cybersecurity Requirements for Alerting Systems

Share Copy link Facebook X Linkedin Bluesky Email...

26/06/2026

Study: Roku Most Used But Not Highest Rated Streaming Platform

Share Copy link Facebook X Linkedin Bluesky Email...

26/06/2026

Samsung Ads Announces First Shoppable CTV Partners

Share Copy link Facebook X Linkedin Bluesky Email...

26/06/2026

Gray Media Names Annie Cordell General Manager of WMBF

Share Copy link Facebook X Linkedin Bluesky Email...

26/06/2026

Neko Oji: The Guy That Got Reincarnated as a Cat Edited with DaVinci Resolve Studio

Neko Oji: The Guy That Got Reincarnated as a Cat Edited with DaVinci Resolve Stu...

26/06/2026

Adobe to Acquire Topaz Labs

Adobe to Acquire Topaz Labs Brie Clayton June 25, 2026 0 Comments Adobe has seen strong demand for its AI products for creatives, including Adobe Fire...

26/06/2026

Berklee Students Earn Dedicated Section at Raindance Film Festival in London

Berklee Students Earn Dedicated Section at Raindance Film Festival in London Five documentary short films produced in the Africana Studies Department screen a...

26/06/2026

Keeping Pace with the Race

How IMS Productions and FOX Sports scaled coverage of the 109th Indianapolis 500. The last lap of this year's Indianapolis 500 delivered the kind of ending...

26/06/2026

Prison Wives of TikTok is Locked In for U and U&W

Flicker Productions to produce five-part docu-reality series following women who have fallen for men in prison and have become TikTok sensations, with brands an...

26/06/2026

Automating post-production workflows with Baselight, Daylight, Nara & FilmLight API. New York. 8 July 2026

Catch up on the latest developments across Baselight and Daylight v7, Nara and F...

26/06/2026

DFT installs second Polar HQ at China News Film Confirming Position as China's Leading 8K Film Preservation Partner

26. June 2026 News DFT is pleased to announce that a second Polar HQ film s...

26/06/2026

New documentary Freedom Founder: Thomas McKean and the American Revolution comes to RT

New documentary Freedom Founder: Thomas McKean and the American Revolution airs ...

25/06/2026

Launching a Career in Broadcast Engineering: Academic Paths and Essential Certifications

Launching a Career in Broadcast Engineering: Academic Paths and Essential Certif...

25/06/2026

SVG Students To Watch: Jude Kieffer, Ball State University

This superstar shooter/storyteller from Central Indiana hopes to make his mark in the blossoming sports-documentary and -features space In the live-sports-vid...

25/06/2026

Presidio and NHL Renew Multiyear North American Technology Partnership

Presidio and the National Hockey League have announced a multiyear renewal of their North American partnership. Presidio will remain an Official Technology Inno...

25/06/2026

Strike Fighter League Hits the Industry as First Professional Air Combat Sport

Strike Fighter League (SFL) is the world's first professional air combat digital sport that combines elite human performance and physical immersion with cut...

25/06/2026

Rise Reveals 2026 Worldwide Mentoring Cohorts to Support Future Industry Leaders

Rise, the award-winning advocacy group for gender diversity in the broadcast and media technology sector, is pleased to announce the global mentoring cohort for...

25/06/2026

MLB Network To Air American Association of Professional Baseball All-Star Game for First Time on July 15

The 2026 American Association of Professional Baseball (AAPB) All-Star Game will...

25/06/2026

Mediaproxy Partners with HVS for U.S. Broadcast Market

Mediaproxy has named Heartland Video Systems (HVS) as its exclusive partner for US television broadcasting. The Wisconsin-based systems integrator will represen...

25/06/2026

Backblaze Inks Five-Year Multi-Exabyte Data Storage Agreement with CoreWeave

Backblaze has formed an agreement with CoreWeave to create The Essential Cloud for AI. Under the multi-exabyte, $335 million agreement, Backblaze will provide...