Across cultures, countries, and communities, the stories we share bring us together. And more often than not, it is the voices of the speakers that lend as much weight to the stories as the narratives themselves. For more than 15 years, Spotify’s global platform has empowered creators of all walks to share their work with audiences around the world. At its core, this has been made possible through technology that’s leveraged the power of audio to overcome barriers to access, borders, and distance. But with recent advancements, we’ve been wondering: Are there more ways we can bridge the language gap so that these voices can be heard worldwide?
Today, we’re excited to pilot Voice Translation for podcasts, a groundbreaking feature powered by AI that translates podcasts into additional languages—all in the podcaster’s voice.
This Spotify-developed tool leverages the latest innovations—one of which is OpenAI’s newly released voice generation technology—to match the original speaker’s style, making for a more authentic listening experience that sounds more personal and natural than traditional dubbing. A podcast episode originally recorded in English can now be available in other languages while keeping the speaker’s distinctive speech characteristics.
As part of the pilot, we’ve worked closely with podcasters Dax Shepard, Monica Padman, Lex Fridman, Bill Simmons, and Steven Bartlett to generate AI-powered voice translations in other languages—including Spanish, French, and German—for a select number of catalog episodes and future episode releases. We’re also looking forward to including other shows, such as Dax Shepard’s eff won with DRS, The Rewatchables from The Ringer, and Trevor Noah’s new original podcast, which launches later this year.
“By matching the creator’s own voice, Voice Translation gives listeners around the world the power to discover and be inspired by new podcasters in a more authentic way than ever before,” says Ziad Sultan, VP of Personalization. “We believe that a thoughtful approach to AI can help build deeper connections between listeners and creators, a key component of Spotify’s mission to unlock the potential of human creativity.”
Voice-translated episodes from pilot creators will be available worldwide to Premium and Free users. We’re starting by releasing an initial bundle of translated episodes in Spanish, with French and German rolling out in the coming days and weeks:
- Lex Fridman Podcast – “Interview with Yuval Noah Harari”
- Armchair Expert – “Kristen Bell, by the grace of god, returns”
- The Diary of a CEO with Steven Bartlett – “Interview with Dr. Mindy Pelz”
We’ll start rolling these out to users on the Now Playing View of supported episodes starting today. Can’t wait and want to hear the episodes right away? Head to the dedicated Voice Translations Hub, which we’ll update with even more voice-translated episodes over the coming weeks and months.
Today is just the beginning. We’re excited to empower creators to bring their storytelling to more listeners across the world. The creator and audience feedback from the pilot will provide important insights for future expansion, iterations, and innovations. As the number of people (100M+) regularly listening to podcasts on Spotify continues to grow, we’ll continue exploring new ways to overcome barriers to storytelling.
Stay tuned to Spotify for Podcasters as we aim to expand access for more creators and languages.
Lex Fridman (pronounced: Freedman)
Research Scientist, MIT, 2015 - current (2023)
Laboratory for Information and Decision Systems (LIDS)
Research: Human-robot interaction and machine learning.
Hiring: I'm hiring
Teaching: deeplearning.mit.edu
Podcast: Lex Fridman Podcast
Sample Conversations: Elon Musk, Mark Zuckerberg, Sam Harris, Joe Rogan, Vitalik Buterin, Grimes, Dan Carlin, Roger Penrose, Jordan Peterson, Richard Dawkins, Liv Boeree, Leonard Susskind, David Fravor, Kanye West, Donald Hoffman, Rick Rubin, etc.
Connect with me @lexfridman on Twitter, LinkedIn, Instagram, Facebook, YouTube, Medium.
https://open.spotify.com/show/2MAi0BvDc6GTFvKFPXnkCL
[글로벌] 스포티파이, 생성 AI 기술로 팟캐스트 음성 다국어 번역 서비스
- 조민수 기자
- 승인 2023.09.26 15:01
[아이티데일리] 스웨덴의 음원 스트리밍 글로벌 서비스 기업 스포티파이(Spotify)가 인공지능(AI)을 사용하여 팟캐스트를 다른 언어로 번역하는 서비스를 시작한다고 CNBC, 포브스지 등이 보도했다. 생성 AI를 제품이나 서비스에 도입하는 움직임이 급속도로 확장되는 모양새다.
이 서비스는 챗GPT로 유명세를 타고 있는 오픈AI(OpenAI)와의 제휴로 이루어진다. 이로써 스포티파이도 생성 AI를 사용하는 글로벌 인터넷 서비스 회사로 이름을 올렸다.
스포티파이는 공식 발표에서 “팟캐스트로 출력되는 음성을 원 발표자의 목소리와 스타일에 맞는 다른 언어로 번역하는 ‘음성 번역’ 기능 파일럿 서비스를 출시한다”고 밝혔다.
http://www.itdaily.kr/news/articlePrint.html?idxno=217067
'XR, Extended Reality, AI' 카테고리의 다른 글
Unreal, Meta Human, 시작하기 1 (1) | 2023.10.19 |
---|---|
찾아보기, Tool, Tensor.art, 가입하고 실행 (0) | 2023.10.15 |
Article, Xcode, updatesLearn about important changes to Xcode. (0) | 2023.10.01 |
언리얼 엔진 설치하기, 언리얼 엔진 문서 (0) | 2023.10.01 |
Smart NPC (0) | 2023.10.01 |