Robert R George
a98a8ee93f
Update robots.txt to prevent crawling of domain blocks ( #26470 )
...
Co-authored-by: Claire <claire.github-309c@sitedethib.com>
2024-12-02 08:03:24 +00:00
Foritus
405f141fe0
Change: Block GPTBot ( #26396 )
2023-08-09 11:58:46 +02:00
ThibG
c4f2433300
Disallow robots from indexing /interact/ ( #10666 )
...
This does not provide any new information and may just triple the number
of crawled pages
2019-05-02 00:10:19 +02:00
nightpool
a5992e5883
Change robots.txt to exclude only media proxy URLs ( #10038 )
...
* Revert "Change robots.txt to exclude some URLs (#10037 )"
This reverts commit 80161f4351
.
* Let's block media_proxy
/media_proxy/ is a dynamic route used for requesting uncached media, so it's
probably bad to let crawlers use it
* misleading comment
2019-02-14 03:11:47 +01:00
Eugen Rochko
80161f4351
Change robots.txt to exclude some URLs ( #10037 )
...
- Exclude static assets
- Exclude uploaded files
- Exclude alternate versions of the profile page
- Exclude media proxy URLs
2019-02-13 21:28:18 +01:00
Eugen Rochko
9c4856bdb1
Initial commit
2016-02-20 22:53:20 +01:00