
Music creation has never been as accessible as it is now. Gone are the days of classical composers, sheet music, and prohibitively expensive studio time when only trained, bankrolled musicians had the opportunity to transcribe notes onto a page. As technology has changed, so too has the art of music creation-and today it is easier than ever for experts and novices alike to compose, produce, and distribute music.
Now, musicians use a computer-based digital standard called MIDI (pronounced MID-ee ). MIDI acts like sheet music for computers, describing which notes are played and when-in a format that's easy to edit. But creating music from scratch, even using MIDI, can still be very tedious. If you play piano and have a MIDI keyboard, you can create MIDI by playing. But if you don't, you must create it manually: note by note, click by click.
To help solve this problem, Spotify's machine learning experts trained a neural network to predict MIDI note events when given audio input. The network is packaged in a tool called Basic Pitch, which we just released as an open source project.
Basic Pitch makes it easier for musicians to create MIDI from acoustic instruments-for example, by singing their ideas, says Rachel Bittner, a research manager at Spotify who is focused on applied machine learning on audio. It can also give musicians a quick starting point' transcription instead of having to write down everything manually, saving them time and resources. Basically, it allows musicians to compose on the instrument they want to compose on. They can jam on their ukulele, record it on their phone, then use Basic Pitch to turn that recording into MIDI. So we've made MIDI, this standard that's been around for decades, more accessible to more creators. We hope this saves them time and effort while also allowing them to be more expressive and spontaneous.
For the Record asked Rachel to tell us more about the thinking and development that go into Basic Pitch and other machine learning efforts, and how the team decided to open up the tool for anyone to access and to innovate on.
Help us understand the basics. How are machine learning models being applied to audio? Rachel Bittner
On the audio ML (machine learning) teams at Spotify, we build neural networks-like the ones that are used to recognize images or understand language-but ours are designed specifically for audio. Similar to how you ask your voice assistant to identify the words you're saying and also make sense of the meaning behind those words, we're using neural networks to understand and process audio in music and podcasts. This work combines our ML research and practices with domain knowledge about audio-understanding the fundamentals of how music works, like pitch, tone, tempo, the frequencies of different instruments, and more.
What are some examples of machine learning projects you're working on that align with our mission to give a million creators the opportunity to live off their art ? Spotify enables creators to reach listeners and listeners to discover new creators. A lot of our work helps with this in indirect ways-for example, identifying tracks that might go well together on a playlist because they share similar sonic qualities like instrumentation or recording style. Maybe one track is already a listener's favorite and the other one is something new they might like.
We also build tools that help creative artists actually create. Some of our tech is in Soundtrap, Spotify's digital audio workstation (DAW), which is used to produce music and podcasts. It's like having a complete studio online. And then there's Basic Pitch, which is a stand-alone tool for converting audio into MIDI that we just released as an open source project. We open sourced Basic Pitch and built an online demo, so anyone can use it to translate musical notes in a recording (including voice, guitar, or piano).
Unlike similar ML models, Basic Pitch is not only versatile and accurate at doing this, but it's also fast and computationally lightweight. So the musician doesn't have to sit around forever waiting for their recording to process. And on the technological and environmental side, it uses way less energy-we're talking orders of magnitude less-compared to other ML models. We named the project Basic Pitch because it can also detect pitch bends in the notes, which is a particularly tricky problem for this kind of model. But also because the model itself is so lightweight and fast.
What else makes Basic Pitch a unique machine learning project for Spotify? I mentioned before how computationally lightweight it is-that's a good thing. In my opinion, the ML industry tends to overlook the environmental and energy impact of their models. Usually with ML models like this-whether it's for processing images, audio, or text-you throw as much processing power as you can at the problem as the default method for reaching some level of accuracy. But from the beginning, we had a different approach in mind: We wanted to see if we could build a model that was both accurate and efficient, and if you have that mindset from the start, it changes the technical decisions you make in how you build the model. Not only is our model as accurate as (or even more accurate than) similar models, but since it's lightweight, it's also faster, which is better for the user, too.
What's the benefit of open sourcing this tool? It gives more people access to it since anyone with a web browser can use the online demo. Plus, we believe the external contributions from the open source community help it evolve as software to create a better, more useful product for everyone. For example, while we believe Basic Pitch solves an important problem, the quality of the MIDI that our system (and others') pro
Most recent headlines
06/10/2025
France T l visions, France's leading broadcaster, has received the 2025 EBU ...
06/09/2025
(L-R) Dylan O'Brien and James Sweeney attend the 2025 Sundance Film Festival Twinless premiere at Eccles Theatre. (Photo by George Pimentel/Shutterstock f...
06/09/2025
LONDON Vizrt has introduced Viz Arena 6, the newest version of its all-in-one live augmented reality (AR) graphics and virtual advertising sports solution. The ...
06/09/2025
SEATTLE Amazon Web Services (AWS) will feature 56 AWS Partners making various demos that showcase the technologies and use cases shaping the future of the Media...
06/09/2025
In news that highlights the ongoing importance of video games, PBS Kids is making its first foray into gameplay content with the September 5 launch of Odd Squa...
06/09/2025
PBS chief executive Paula Kerger has sent an email to staff outlining plans to cut about 100 positions or 15% of its staff, following the loss of Federal fundin...
06/09/2025
A new era of recognition as IABM honors the people, projects and innovations driving real impact
IABM has confirmed the shortlist for the new IABM Impact Award...
06/09/2025
Vizrt, the leader in live production technology revolutionizing viewer experiences, announces new capabilities to help customers create once, adapt automaticall...
06/09/2025
Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, will spotlight four of its innovative audio solutions at th...
06/09/2025
DigitalGlue, creator of the award-winning creative.space managed storage platform, today announced a technology preview of Creative Intelligence (CI) powered by...
06/09/2025
Eye Filmmuseum is the Netherlands leading film museum, offering a rich variety of experiences from screenings of classic films to cultural exhibitions. With fou...
06/09/2025
The latest innovations in Grass Valley's AMPP applications will be on full display at IBC 2025, as the company brings significant updates to Playout X, its ...
06/09/2025
Disguise has announced the launch of the GX 3 , its most powerful media server ever. Built on NVIDIA's cutting-edge Blackwell GPU architecture and including...
06/09/2025
Autoscript and Autocue have announced the launch of a new advanced PTZ prompter system shared by both brands and designed to provide seamless, professional prom...
06/09/2025
Sachtler has added four new models to its award-winning aktiv and FSB Mk II fluid head ranges. The aktiv16T and aktiv18T, plus the FSB 16T Mk II and FSB 18T Mk...
06/09/2025
CGI (TSX: GIB.A) (NYSE: GIB), one of the largest independent IT and business consulting services firms in the world, will present its bold new vision for the fu...
06/09/2025
DHD has chosen IBC 2025 (Amsterdam, 12-15 September) as the launch venue for the latest version of its RM1 all-in-one portable audio production and broadcast sy...
06/09/2025
AJA Video Systems debuted IP25-R, a Mini-Converter for connecting SMPTE ST 2110 networks with 4K SDI/HDMI infrastructures. IP25-R lets broadcast, production, an...
06/09/2025
AJA Video Systems today introduced new products and updates ahead of the International Broadcasting Convention (IBC) 2025 that streamline signal flow management...
06/09/2025
Researchers map key human proteins that power coronavirus replication, pointing to new treatment strategies Findings from Scripps Research reveal promising drug...
05/09/2025
Your listening habits are as unique as you are-and this year, Spotify has introduced a wave of new features to help you personalize your experience. From playli...
05/09/2025
SBS kicks off a confident slate with the 2026 FIFA World Cup 2026 , premium dram...
05/09/2025
Today's Historic Settlement Underscores SBS's Powerful New Series The Pe...
05/09/2025
L3Harris Chief Financial Officer and Aerojet Rocketdyne President Ken Bedingfiel...
05/09/2025
PHILADELPHIA NBC Sports will present tonight's NFL Kickoff Game between the '25 Super Bowl champion Philadelphia Eagles and Dallas Cowboys on Peacock in...
05/09/2025
Warner Bros. Discovery filed a lawsuit against Gen AI company Midjourney this week, claiming that the company violated the studio's copyright....
05/09/2025
Alum Esin Ayd ng z Pens Nevermore Alma Mater' for Netflix's Wednesday The Grammy-nominated composer appears on this season's soundtrack alongside...
05/09/2025
C-SPAN and YouTube this week announced an agreement in which YouTube will sponsor C-SPAN's America 250 programming and expand access to C-SPAN's politic...
05/09/2025
LONDON Live broadcast infrastructure solutions provider Techex has added Peter Dawidzik as senior director, sales and business development....
05/09/2025
As news organizations and broadcasters face more pressure than ever to capture, verify, and distribute real-time access to breaking news, Reuters and Amazon Web...
05/09/2025
LOS ANGELES The Hollywood Professional Association (HPA) today said it has begun accepting submissions for its expanded HPA Engineering Excellence Awards, which...
05/09/2025
Sage has released an update to its most recent firmware for its Digital ENDEC model 3644....
05/09/2025
LONDON and LOS ANGELES ThinkAnalytics will debut ThinkMetadataAI, the company's latest step in recommendations, search and discovery services....
05/09/2025
WASHINGTON The Federal Communications Commissions Media Bureau has announced that the agency is initiating a phased process to lift the current freeze on major ...
05/09/2025
TORONTO Quickplay will feature its newly unveiled AI Studio that assists broadcasters and streamers in transforming their content libraries into short-form asse...
05/09/2025
Boston Conservatory at Berklee Announces Five-Year Partnership with The Verdon F...
05/09/2025
Alum Esin Ayding z Pens Nevermore Alma Mater' for Netflix's Wednesday The Grammy-nominated composer appears on this season's soundtrack alongside...
05/09/2025
Rohde & Schwarz UK awarded Silver Award under Defence Employer Recognition Schem...
05/09/2025
Back to All News
Two Desperate Souls and One Desperate Choice: As You Stood By...
05/09/2025
Back to All News
Netflix Unveils New BAKI-DOU' Anime and First Look at BE...
05/09/2025
This weekend sees another feast in store for all sport fans as RT airs a jam-packed schedule of live, free-to-air sport.
The Amgen Irish Open continues all we...
04/09/2025
(Joel Edgerton and Felicity Jones appear in Train Dreams by Clint Bentley, an of...
04/09/2025
SBS kicks off a confident slate with the 2026 FIFA World Cup2026 , premium drama...
04/09/2025
Watch the Record-breaking Koori Knockout 2025 LIVE and EXCLUSIVE on NITV and SBS...
04/09/2025
WALTHAM, Mass. Zixi, a provider of video-delivery-over-IP technology, has named Sue Mitchell as director of account management, EMEA....
04/09/2025
NEW YORK In news that highlights the importance of the NFL and sports for the TV and streaming industry, NBCUniversal is reporting record revenue for its 20th s...
04/09/2025
OTTAWA Ross Video has acquired LAMA, a Dutch-based developer of advanced audio production software known for its innovative live mixing solutions. The acquisiti...
04/09/2025
CINCINNATI GatesAir will bring its 5G passthrough demo to IBC audiences once again as interest in the technology's value for broadcast-to-mobile delivery gr...
04/09/2025
BURY ST. EDMUNDS, England Autoscript and Autocue will showcase their newly launched PTZ prompter system during IBC2025, Sept. 12-15 at the RAI Amsterdam Convent...
04/09/2025
Watch the Benn Family Band Perform I Wont Give Up on Americas Got Talent Assistant Professor Loren Benn and her family gave an emotional live performance of t...