With Microsoft retiring the Bing Search APIs in August, legacy search engines have abandoned the developer community who need real-time access to information.
We're stepping in to provide a search API designed for the new retrieval paradigms introduced by frontier AI systems.
We built Perplexity Search API around three criteria:
The system processes ~200M daily queries using distributed crawling/indexing, multi-stage ranking, and dynamic parsing.
We use AI to dynamically parse websites, continuously refining how it extracts and segments high-quality, meaningful content.
LLMs drive a self-improvement loop, balancing completeness and quality to keep the index fresh and accurately divided into spans for precise retrieval.
After releasing the Perplexity Search API and SDK, we developed a simple, neutral evaluation framework to benchmark search APIs as used by AI agents
We compared our API and found state-of-the-art results on both quality and latency, removing the tradeoff between speed and accuracy.
Perplexity stands as both the fastest and highest-quality API on the marketplace.
We deliver a median latency of 358ms—over 150ms faster than the next-best provider—while keeping 95th-percentile latency under 800ms.
Perplexity Search API achieves leading quality across single-step and deep research benchmarks, consistently outperforming competitors.
Voice Assistant uses web browsing and multi-app actions to book reservations, send emails and calendar invites, play media, and more—all from the Perplexity iOS app.
Update your app in the App Store and start asking today.
Perplexity Voice Assistant can search for and play podcasts, YouTube videos, and other media.
Need to look up and reschedule meetings in your calendar? Voice Assistant can help you find events and draft emails.
Perplexity's Sonar—built on Llama 3.3 70b—outperforms GPT-4o-mini and Claude 3.5 Haiku while matching or surpassing top models like GPT-4o and Claude 3.5 Sonnet in user satisfaction.
At 1200 tokens/second, Sonar is optimized for answer quality and speed.
Sonar significantly outperforms GPT-4o-mini and Claude 3.5 Haiku in user satisfaction.
It also surpasses Claude 3.5 Sonnet and nearly matches GPT-4o, doing so at a fraction of the cost and over 10x faster.
Powered by Cerebras inference infrastructure, Sonar delivers answers at blazing fast speeds, achieving a decoding throughput that is nearly 10x times faster than comparable models like Gemini 2.0 Flash.
Assistant uses reasoning, search, and apps to help with daily tasks ranging from simple questions to multi-app actions. You can book dinner, find a forgotten song, call a ride, draft emails, set reminders, and more.
Available on Play Store.
Perplexity Assistant browses the web to complete tasks for you. For example, if you want to be reminded of a public event, it will find the correct time and date and set an intelligent reminder.
It also maintains context from one action to another — if you’re researching restaurants in your area and want to reserve a table, choose an option and Assistant will help book it.