• About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post
No Result
View All Result
Digital Phablet
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
No Result
View All Result
Digital Phablet
No Result
View All Result

Home » AI Headphones Powered by Apple M2 Translate Multiple Speakers

AI Headphones Powered by Apple M2 Translate Multiple Speakers

Rukhsar Rehman by Rukhsar Rehman
May 10, 2025
in News
Reading Time: 3 mins read
A A
AI Headphones Powered by Apple M2 Translate Multiple Speakers
ADVERTISEMENT

Select Language:


Google’s Pixel Buds have long provided impressive real-time translation capabilities. Recently, companies like Timkettle have launched similar earbuds aimed at business users. However, all these devices could only translate one audio source at a time.

ADVERTISEMENT

Researchers at the University of Washington have created a groundbreaking set of AI-powered headphones that can translate multiple voices simultaneously. Imagine an expert linguist in a busy bar, effortlessly processing conversations in several languages all at once.

This innovation is known as Spatial Speech Translation and utilizes binaural headphones. These headphones replicate auditory experiences as humans naturally perceive sounds. To capture this effect, microphones are positioned on a dummy head, mimicking the distance between human ears.

The significance of this approach lies in its ability to help us not only hear sounds but also discern their direction. The ultimate aim is to craft a natural sound environment, delivering a stereo experience akin to attending a live concert, which is referred to as spatial listening in today’s terminology.

ADVERTISEMENT

Led by Professor Shyam Gollakota, the team has worked on various impressive projects, including apps that provide underwater GPS for smartwatches and brain implants that interact with electronic devices.

How does multi-speaker translation work?

“We’re capturing the uniqueness of each person’s voice along with their directional speech for the first time,” notes Gollakota, associated with the Paul G. Allen School of Computer Science & Engineering.

The system functions like radar, detecting the number of speakers in its vicinity and dynamically updating that count as individuals move in and out of range. Impressively, this process operates entirely on-device, ensuring privacy by not sending audio data to cloud servers.

Along with translating speech, the technology preserves the tonal qualities and loudness of each speaker’s voice. It also includes dynamic adjustments based on how a speaker moves throughout the space. Interestingly, Apple is reportedly developing a similar feature for AirPods that would allow for real-time audio translation.

How does it all come to life?

In its testing phases, the UW team assessed the translation features of the AI headphones in various indoor and outdoor settings. The system can process and output translated spoken audio in 2 to 4 seconds. Participants seemed to prefer a delay of around 3 to 4 seconds, but the team is actively enhancing the translation speed.

While initial tests have focused on translations involving Spanish, German, and French, the researchers aim to expand their capabilities to include additional languages. Their approach blends blind source separation, localization, real-time expressive translation, and binaural sound rendering into a seamless process.

ADVERTISEMENT

The team utilized a speech translation model capable of real-time operation on Apple’s M2 silicon, performing audio processing with Sony’s noise-cancelling WH-1000XM4 headphones paired with a Sonic Presence SP15C binaural USB microphone.

Moreover, the code for this proof-of-concept device is available for others to explore, enabling the scientific community and hobbyists to build on the groundwork laid by the UW team.

ChatGPT Add us on ChatGPT Perplexity AI Add us on Perplexity
Tags: AIApple M2Headphonesmultiple speakerstranslate
ADVERTISEMENT
Rukhsar Rehman

Rukhsar Rehman

A University of California alumna with a background in mass communication, she now resides in Singapore and covers tech with a global perspective.

Related Posts

ChatGPT May Get Parental Controls and Other AIs Might Follow
News

ChatGPT May Get Parental Controls and Other AIs Might Follow

August 28, 2025
Quizlet Announces Big AI Update for Back to School
News

Quizlet Announces Big AI Update for Back to School

August 28, 2025
UN forms expert panel to steer global AI governance
News

UN forms expert panel to steer global AI governance

August 27, 2025
The Hottest New ChatGPT Trend Is Morbid
News

Elon Musk Sues Apple Over Favoring ChatGPT

August 26, 2025
Next Post
IAF Official Confirms Pakistan Struck Indian Air Bases with Missiles

IAF Official Confirms Pakistan Struck Indian Air Bases with Missiles

  • About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post

© 2025 Digital Phablet

No Result
View All Result
  • Home
  • News
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones

© 2025 Digital Phablet