How A Tiny Polish Startup Became The Multi-Billion-Dollar Voice Of AI

By Forbes

Share:

Key Concepts

  • AI Text-to-Speech (TTS): Artificial intelligence technology that converts written text into spoken audio.
  • Voice Cloning: The process of using AI to create a synthetic voice that mimics the unique vocal characteristics of a specific individual.
  • Deepfake Swindles: Fraudulent schemes using AI-generated media, such as cloned voices, to impersonate individuals and deceive victims.
  • Lector: A single voice actor who reads all dialogue in dubbed films, particularly common in Poland, often in a monotone.
  • Valuation: The estimated worth of a company, in this case, 11 Labs' market value.
  • Profit Margin: The percentage of revenue that remains after all expenses have been deducted from sales.
  • Speech-to-Text: Technology that converts spoken language into written text.

The Genesis of 11 Labs: Solving a "Uniquely Polish Horror"

11 Labs, a Polish startup, was founded by Mateusz "Mati" Staniszewski and Piotr Dąbkowski to address the "horrible" quality of dubbed films in Poland. Staniszewski, a former Palantir employee, and Dąbkowski, a Google engineer, observed that Polish dubbed content typically features a single "lector" delivering all dialogue in an "unnerved Slavic monotone," lacking variation between speakers, a practice Staniszewski attributes to a "communist thing that stuck as a cheap way to produce content." Young audiences reportedly "hate it."

The co-founders initially experimented with artificial intelligence, realizing that an AI public speaking coach project could solve this specific problem by providing natural, varied voices. They pooled their savings and, by May 2022, quit their jobs to dedicate themselves full-time to 11 Labs.

Technological Breakthrough and Initial Launch

11 Labs' new AI text-to-speech generator immediately surpassed existing solutions like Apple's Siri and Amazon's Alexa, which were characterized by "robotic voices." 11 Labs' AI voices demonstrated the capability to convey emotions such as "happiness, excitement, and even laughter."

In January 2023, 11 Labs launched its first model. This model could convert any text into spoken audio using AI, including cloning a user's own voice or, controversially, someone else's.

Rapid Adoption and Market Demand

The launch generated immediate demand across various sectors:

  • Authors: Utilized the software to instantly generate audiobooks. Pro rates for higher quality and more time start from $99 per month.
  • YouTube Creators: Employed 11 Labs to translate their videos into other languages, with models now supporting 29 languages.
  • Apps: The Warsaw and London-based startup secured deals with language learning and meditation applications.
  • Media Companies: Major players like HarperCollins and Germany's Burda Group adopted the technology.

Jennifer Lee of Andreessen Horowitz, which co-led a $19 million funding round in May 2023, noted, "It was obvious that this was the best model and everyone was picking it off the shelf." A year later, Staniszewski and Dąbkowski were recognized in Forbes' 30 Under 30 Europe list.

Funding, Valuation, and Financial Success

Despite ethical concerns surrounding its misuse, venture capitalists continued to invest heavily in 11 Labs. The company has raised "more than $300 million in all," leading to a "soaring to a $6.6 billion valuation in October" (presumably October 2023 or 2024, given the context of the article's publication) and establishing it as "one of Europe's most valuable startups."

Forbes estimates that Staniszewski (30, CEO) and Dąbkowski (30, Research Head) are now billionaires, each "worth just over $1 billion." Uniquely among AI firms, 11 Labs is profitable, netting an estimated "$116 million in the last 12 months," representing a "60% margin" on its "$193 million in trailing 12-month revenue."

Diverse Revenue Streams and Corporate Partnerships

11 Labs' revenue is split almost evenly:

  • Corporate Clients (approximately 50%): Companies like Cisco, Twilio, and Swiss recruitment agency Adecco use the technology for customer service calls and interviewing job seekers. Epic Games leverages it to voice characters in Fortnite, including a chat with Darth Vader, with the consent of James Earl Jones's estate.
  • Early Adopters (approximately 50%): This segment includes YouTubers, podcasters, and authors who were among the first to adopt the platform.

Challenges and Ethical Concerns: The Dark Side of Voice Cloning

The advanced capabilities of 11 Labs' AI have also led to "unnerving uses." AI sound-alikes of public figures quickly went viral, including:

  • President Trump "crassly narrating video game duels."
  • Actress Emma Watson "reading Mein Kampf."
  • Podcaster Joe Rogan "touting scams."

More alarmingly, fraudsters began exploiting AI cloning tools to "impersonate loved ones' voices and steal millions in sophisticated deepfake swindles."

Competitive Landscape and 11 Labs' Edge

11 Labs is now competing against tech giants like Google, Microsoft, Amazon, and OpenAI to become the "de facto voice of AI." While this space isn't new, with tech companies developing speech-related products for about a decade (e.g., Microsoft's $20 billion acquisition of Nuance in March 2022, OpenAI's voice tool launched in March 2022), 11 Labs, with its 300-person team (as of October 2024), is not "playing catch-up."

Its models are so superior that the company can charge "up to three times as much as these American rivals." 11 Labs boasts the "largest by far" library of "10,000 uncannily human-sounding voices," which now includes A-listers like Michael Caine and Matthew McConaughey. Staniszewski confidently states, "We are one of the very few companies that are ahead of OpenAI, not only on speech, but speech to text and music. That's hard."

Conclusion: The Voice of AI

11 Labs has rapidly transformed from a small Polish startup into a multi-billion dollar enterprise, driven by its superior AI text-to-speech technology that offers emotional range and voice cloning capabilities. By addressing a specific market need and continuously innovating, it has attracted significant investment, achieved profitability, and secured a dominant position in the competitive AI voice market. Despite the ethical challenges posed by deepfake misuse, 11 Labs continues to expand its offerings and client base, positioning itself as a leading force in shaping the future of AI-generated audio.

Chat with this Video

AI-Powered

Hi! I can answer questions about this video "How A Tiny Polish Startup Became The Multi-Billion-Dollar Voice Of AI". What would you like to know?

Chat is based on the transcript of this video and may not be 100% accurate.

Related Videos

Ready to summarize another video?

Summarize YouTube Video