Close Menu
Voxa News

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    'Contradictions in objectives' of Chinese Studies Dept & what China would like UK schools to project

    August 5, 2025

    Childcare costs push families out of work and into poverty

    August 5, 2025

    14 Best Beauty Box Subscriptions, Tested for Months (2025)

    August 5, 2025
    Facebook X (Twitter) Instagram
    Voxa News
    Trending
    • 'Contradictions in objectives' of Chinese Studies Dept & what China would like UK schools to project
    • Childcare costs push families out of work and into poverty
    • 14 Best Beauty Box Subscriptions, Tested for Months (2025)
    • David Squires on … his boxing forebear who died on the Titanic
    • More disclosure of suspects’ immigration status needed, Cooper says
    • Chipmaker TSMC says it has discovered potential trade secret leaks
    • Palestine: Peace de Resistance review – an absurdist response to an abominable situation | Edinburgh festival 2025
    • Anne Sofie Madsen Copenhagen Spring 2026
    Tuesday, August 5
    • Home
    • Business
    • Health
    • Lifestyle
    • Politics
    • Science
    • Sports
    • Travel
    • World
    • Entertainment
    • Technology
    Voxa News
    Home»Technology»Perplexity is allegedly scraping websites it’s not supposed to, again
    Technology

    Perplexity is allegedly scraping websites it’s not supposed to, again

    By Olivia CarterAugust 5, 2025No Comments2 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Perplexity is allegedly scraping websites it's not supposed to, again
    REUTERS / Reuters
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Web crawlers deployed by Perplexity to scrape websites are allegedly skirting restrictions, according to a new report from Cloudflare. Specifically, the report claims that the company’s bots appear to be “stealth crawling” sites by disguising their identity to get around robots.txt files and firewalls.

    Robots.txt is a simple file websites host that lets web crawlers know if they can scrape a websites’ content or not. Perplexity’s official web crawling bots are “PerplexityBot” and “Perplexity-User.” In Cloudflare’s tests, Perplexity was still able to display the content of a new, unindexed website, even when those specific bots were blocked by robots.txt. The behavior extended to websites with specific Web Application Firewall (WAF) rules that restricted web crawlers, as well.

    Cloudflare

    Cloudflare believes that Perplexity is getting around those obstacles by using “a generic browser intended to impersonate Google Chrome on macOS” when robots.txt prohibits its normal bots. In Cloudlfare’s tests, the company’s undeclared crawler could also rotate through IP addresses not listed in Perplexity’s official IP range to get through firewalls. Cloudflare says that Perplexity appears to be doing the same thing with autonomous system numbers (ASNs) — an identifier for IP addresses operated by the same business — writing that it spotted the crawler switching ASNs “across tens of thousands of domains and millions of requests per day.”

    Engadget has reached out to Perplexity for comment on Cloudflare’s report. We’ll update this article if we hear back.

    Up-to-date information from websites is vital to companies training AI models, especially as service’s like Perplexity are used as replacements for search engines. Perplexity has also been caught in the past circumventing the rules to stay up-to-date. Multiple websites reported in 2024 that Perplexity was still accessing their content despite them forbidding it in robots.txt — something the company blamed on the third-party web crawlers it was using at the time. Perplexity later partnered with multiple publishers to share revenue earned from ads displayed alongside their content, seemingly as a make-good for its past behavior.

    Stopping companies from scraping content from the web will likely remain a game of whack-a-mole. In the meantime, Cloudflare has removed Perplexity’s bots from its list of verified bots and implemented a way to identify and block Perplexity’s stealth crawler from accessing its customers’ content.

    allegedly Perplexity scraping supposed websites
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Olivia Carter
    • Website

    Olivia Carter is a staff writer at Verda Post, covering human interest stories, lifestyle features, and community news. Her storytelling captures the voices and issues that shape everyday life.

    Related Posts

    14 Best Beauty Box Subscriptions, Tested for Months (2025)

    August 5, 2025

    Chipmaker TSMC says it has discovered potential trade secret leaks

    August 5, 2025

    Can’t Look Away review – a harrowing, heartbreaking indictment of social media’s ruthlessness | Film

    August 5, 2025

    Developers go their own way as jobs dry up

    August 5, 2025

    A top designer was banned from Dribbble. Now he’s building his own competitor.

    August 5, 2025

    HelloFresh Coupon Codes: 55% Off + Free Meals – August 2025

    August 5, 2025
    Leave A Reply Cancel Reply

    Medium Rectangle Ad
    Top Posts

    27 NFL draft picks remain unsigned, including 26 second-rounders and Bengals’ Shemar Stewart

    July 17, 20251 Views

    Eight healthy babies born after IVF using DNA from three people | Science

    July 17, 20251 Views

    Massive Attack announce alliance of musicians speaking out over Gaza | Kneecap

    July 17, 20251 Views
    Don't Miss

    'Contradictions in objectives' of Chinese Studies Dept & what China would like UK schools to project

    August 5, 2025

    A report by UK‑China Transparency (UKCT) has unveiled major challenges affecting students, scholars and institutions…

    Childcare costs push families out of work and into poverty

    August 5, 2025

    14 Best Beauty Box Subscriptions, Tested for Months (2025)

    August 5, 2025

    David Squires on … his boxing forebear who died on the Titanic

    August 5, 2025
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    Medium Rectangle Ad
    Most Popular

    27 NFL draft picks remain unsigned, including 26 second-rounders and Bengals’ Shemar Stewart

    July 17, 20251 Views

    Eight healthy babies born after IVF using DNA from three people | Science

    July 17, 20251 Views

    Massive Attack announce alliance of musicians speaking out over Gaza | Kneecap

    July 17, 20251 Views
    Our Picks

    As a carer, I’m not special – but sometimes I need to be reminded how important my role is | Natasha Sholl

    June 27, 2025

    Anna Wintour steps back as US Vogue’s editor-in-chief

    June 27, 2025

    Elon Musk reportedly fired a key Tesla executive following another month of flagging sales

    June 27, 2025
    Recent Posts
    • 'Contradictions in objectives' of Chinese Studies Dept & what China would like UK schools to project
    • Childcare costs push families out of work and into poverty
    • 14 Best Beauty Box Subscriptions, Tested for Months (2025)
    • David Squires on … his boxing forebear who died on the Titanic
    • More disclosure of suspects’ immigration status needed, Cooper says
    • About Us
    • Disclaimer
    • Get In Touch
    • Privacy Policy
    • Terms and Conditions
    2025 Voxa News. All rights reserved.

    Type above and press Enter to search. Press Esc to cancel.