Top 100 LLM Sources: BotRank Study on 1.2M AI Responses
Which sources do ChatGPT, Gemini and Perplexity use most? BotRank reveals the Top 100 LLM sources analysed across 1.2 million AI responses in 2026.
Summary: at BotRank.ai we analyse thousands of prompts daily for our clients - a goldmine of data for understanding how AI engines actually work. We analysed more than 1.2 million responses generated by ChatGPT, Gemini, Perplexity, Claude and others over the past few weeks. The finding is clear: LLMs rely massively on third-party authority sources. Reddit, YouTube and Wikipedia dominate the ranking, closely followed by review platforms like Trustpilot and mainstream media. Discover the full ranking of the 100 most-used sources and our tips for adapting your GEO strategy.
Understanding which LLM sources are favoured is the cornerstone of any effective GEO (Generative Engine Optimization) strategy. If you know which sites influence AI responses in your market, you know exactly where you need to be present to increase your AI share of voice.
We therefore conducted an unprecedented study: we dissected more than 1.2 million responses generated by the leading AI engines over the past few weeks. The goal? To extract, rank and analyse the Top 100 most-used sources by AI.
Using our tracking technology, we collected and analysed the source links provided by AI engines for the responses they generated. We then aggregated this data to identify the root domains most frequently called upon to build those responses.
Our sample of 1,200,000 analysed responses is particularly robust because it is not biased towards any specific industry. It reflects the real queries of BotRank users, covering a very broad spectrum:
Here is the exact breakdown of AI engines queried in our dataset:
This ranking is built exclusively from data collected for BotRank.ai clients. It reflects a cross-sector, multi-industry reality. Every brand operates in a specific market where the authoritative sources may differ significantly from this global ranking. Running a personalised analysis is the best way to accurately identify which sources truly influence LLMs in your market, your sector and your target queries.
The percentages shown in the table below represent the usage rate of each source within this Top 100. For example, if a source shows 10%, it means it accounts for 10% of all citations generated by the 100 most popular sites.
Here is the complete ranking of the 100 most influential domains in AI-generated responses in 2026.
Ranks 1 to 50
| Rank | Source (Domain) | Usage rate (Top 100) |
|---|---|---|
| 1 | reddit.com | 11.47% |
| 2 | youtube.com | 9.87% |
| 3 | wikipedia.org | 9.43% |
| 4 | trustpilot.com | 7.96% |
| 5 | linkedin.com | 2.85% |
| 6 | lefigaro.fr | 2.71% |
| 7 | lemonde.fr | 2.18% |
| 8 | appvizer.fr | 2.00% |
| 9 | techradar.com | 1.78% |
| 10 | indeed.com | 1.70% |
| 11 | lafabriquedunet.fr | 1.48% |
| 12 | leparisien.fr | 1.47% |
| 13 | facebook.com | 1.31% |
| 14 | alibaba.com | 1.27% |
| 15 | service-public.gouv.fr | 1.23% |
| 16 | quechoisir.org | 1.16% |
| 17 | lesnumeriques.com | 1.10% |
| 18 | europages.fr | 1.10% |
| 19 | forbes.com | 1.08% |
| 20 | instagram.com | 1.07% |
| 21 | bfmtv.com | 1.02% |
| 22 | nih.gov | 0.94% |
| 23 | clubic.com | 0.91% |
| 24 | pagesjaunes.fr | 0.90% |
| 25 | frandroid.com | 0.88% |
| 26 | sortlist.fr | 0.85% |
| 27 | hellowork.com | 0.83% |
| 28 | francetvinfo.fr | 0.80% |
| 29 | amazon.fr | 0.77% |
| 30 | leblogdumoderateur.com | 0.74% |
| 31 | numerama.com | 0.72% |
| 32 | ouest-france.fr | 0.70% |
| 33 | glassdoor.fr | 0.68% |
| 34 | journaldunet.com | 0.65% |
| 35 | lesechos.fr | 0.63% |
| 36 | capterra.com | 0.60% |
| 37 | g2.com | 0.58% |
| 38 | franceculture.fr | 0.56% |
| 39 | twitter.com | 0.54% |
| 40 | linternaute.com | 0.52% |
| 41 | fnac.com | 0.51% |
| 42 | franceinter.fr | 0.50% |
| 43 | leboncoin.fr | 0.49% |
| 44 | quora.com | 0.48% |
| 45 | seloger.com | 0.47% |
| 46 | shopify.com | 0.46% |
| 47 | francetvinfo.fr | 0.45% |
| 48 | statista.com | 0.44% |
| 49 | doctolib.fr | 0.43% |
| 50 | malt.fr | 0.43% |
| Rank | Source (Domain) | Usage rate (Top 100) |
|---|---|---|
| 51 | capterra.fr | 0.42 % |
| 52 | welcometothejungle.com | 0.42 % |
| 53 | 20minutes.fr | 0.40 % |
| 54 | tripadvisor.com | 0.40 % |
| 55 | impli.fr | 0.40 % |
| 56 | scribd.com | 0.40 % |
| 57 | amazon.com | 0.40 % |
| 58 | tripadvisor.fr | 0.40 % |
| 59 | cnews.fr | 0.40 % |
| 60 | trustfolio.co | 0.38 % |
| 61 | service-public.fr | 0.38 % |
| 62 | latribune.fr | 0.38 % |
| 63 | getapp.fr | 0.38 % |
| 64 | leprogres.fr | 0.37 % |
| 65 | adobe.com | 0.37 % |
| 66 | cci-paris-idf.fr | 0.36 % |
| 67 | shine.fr | 0.36 % |
| 68 | 01net.com | 0.36 % |
| 69 | cegos.fr | 0.35 % |
| 70 | medium.com | 0.35 % |
| 71 | connexionfrance.com | 0.34 % |
| 72 | cdiscount.com | 0.34 % |
| 73 | hubspot.fr | 0.34 % |
| 74 | challenges.fr | 0.32 % |
| 75 | petitfute.com | 0.32 % |
| 76 | futura-sciences.com | 0.31 % |
| 77 | arxiv.org | 0.31 % |
| 78 | sciencedirect.com | 0.31 % |
| 79 | etsy.com | 0.31 % |
| 80 | theguardian.com | 0.31 % |
| 81 | tiktok.com | 0.31 % |
| 82 | fastercapital.com | 0.30 % |
| 83 | marieclaire.fr | 0.30 % |
| 84 | boursorama.com | 0.29 % |
| 85 | europa.eu | 0.28 % |
| 86 | presse-citron.net | 0.28 % |
| 87 | marques-de-france.fr | 0.28 % |
| 88 | journaldesfemmes.fr | 0.27 % |
| 89 | digitiz.fr | 0.25 % |
| 90 | ladepeche.fr | 0.25 % |
| 91 | idealo.fr | 0.24 % |
| 92 | beta.gouv.fr | 0.23 % |
| 93 | businessinsider.com | 0.21 % |
| 94 | reuters.com | 0.21 % |
| 95 | journaldunet.com | 0.20 % |
| 96 | pappers.fr | 0.20 % |
| 97 | dailymotion.com | 0.19 % |
| 98 | ecologie.gouv.fr | 0.19 % |
| 99 | lesechos.fr | 0.19 % |
| 100 | kompass.com | 0.18 % |
Reading this Top 100 overturns several common assumptions about how AI actually works. Here are the three major trends that emerge from our study.
The Top 10 LLM sources are overwhelmingly dominated by community platforms and encyclopaedic references.
This is the standout finding of the study: Reddit (11.47%) and YouTube (9.87%) take the top two spots, ahead of the ever-present Wikipedia (9.43%). LLMs, particularly ChatGPT and Perplexity, place a massive premium on authentic user experience and community discussions. AI seeks to humanise its responses by drawing on real-world accounts rather than polished brand messaging.
The presence of Trustpilot (7.96%) in 4th place globally is a wake-up call for every brand. When a user asks an AI "What is the best tool for...", the model will systematically consult review platforms to build its recommendation. This trend runs throughout the ranking, with Appvizer (8th), Capterra (36th), G2 (37th) and Sortlist (26th) all featuring prominently. Third-party reputation is no longer optional - it is the engine of your AI visibility.
Journalistic authority continues to carry significant weight in LLM training and RAG pipelines. Le Figaro (6th) and Le Monde (7th) appear in the Top 10, followed by specialist media such as TechRadar (9th) and Le Blog du Modérateur (30th). Digital PR therefore plays a direct role in your GEO strategy: securing an explicit citation or a backlink from an authoritative media outlet is a major trust signal that LLMs immediately absorb when constructing their responses.
Now that you know where AI gets its information, how do you adapt your marketing strategy? Here are 4 actionable tips to boost your AI visibility rate.
The golden rule of GEO is omnipresence across authority sources. Do not limit yourself to optimising your own website. Use this ranking to identify the platforms most relevant to your sector (Reddit, Trustpilot, Capterra, specialist media) and make sure you are present, active and positively rated on each of them. This is what is known as off-page GEO optimisation, which often carries more weight than the technical optimisation of your own pages.
If you are a software publisher, your presence on G2, Capterra and Appvizer is vital. If you operate in B2C or services, Trustpilot and Google My Business should be your priority. Put automated campaigns in place to collect regular, detailed reviews. LLMs love synthesising customer verbatims to justify their recommendations.
Since Reddit is the world's number one source, you can no longer afford to ignore it. Be careful, though: the Reddit community dislikes aggressive self-promotion. The goal is to add genuine value, answer technical questions and subtly mention your solutions when it is relevant. This is a long-term effort that requires an authentic approach.
Even though third-party sources are crucial, your website remains the primary source of information about your own brand. Make sure AI bots can read you easily. Do not block crawlers in your robots.txt and use Schema.org markup. For guidance, consult our technical documentation on micro-data.
It is impossible to manually check every day what ChatGPT, Gemini or Perplexity are saying about you. That is where a platform like BotRank comes in. Our tool lets you:
Try BotRank now, for free!



