this post was submitted on 24 Jul 2024
1 points (100.0% liked)

Technology

59651 readers
2763 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

DuckDuckGo, Bing, Mojeek, and other search engines are not returning full Reddit results any more.

you are viewing a single comment's thread
view the rest of the comments
[–] capital@lemmy.world 0 points 4 months ago (1 children)

You'd probably feel differently if it were your service. Should you be able to control who scrapes your sites or should that be all or nothing?

For the record, I fucking hate what the internet is becoming. I naively believed that even if shit got cordoned off into the walled gardens that are mobile phone apps, the web would remain as open as it was. This is a terrible sign of things to come.

[–] reddig33@lemmy.world 0 points 4 months ago* (last edited 4 months ago) (1 children)

No, I wouldn’t feel differently. In fact letting search engines scrape and point to your content is what leads people to your site. It’s free advertising. If you’re going to let one search engine in, you should let them all in. If you want to be public, be public. Otherwise put up a login firewall and go private.

[–] capital@lemmy.world 0 points 4 months ago

It's not just search engines. Lots of people on Mastodon were using robots.txt to block ChatGPT (and any other LLM company they knew of) from scraping their sites/blogs.

I disagree, to a point. I want to be able to control my services to the greatest extent possible, including picking who scrapes me.

On the other hand, orgs as large as Google doing this poses a real threat to how the internet works right now which I hate.