Iterate endpoints from robots.txt (#471)

* Workflow to extract endpoints from robots.txt - requires projectdiscovery/nuclei/issues/304 fixed to work

Signed-off-by: Casper Guldbech Nielsen <whopsec@protonmail.com>

* Correcting path after conducted testing

Signed-off-by: Casper Guldbech Nielsen <whopsec@protonmail.com>

* Linting fixes

Signed-off-by: Casper Guldbech Nielsen <whopsec@protonmail.com>

* Update of regex after changes to nuclei regex params in extractor

Signed-off-by: Casper Guldbech Nielsen <whopsec@protonmail.com>

* Add detection of SEEEMS CMS

Signed-off-by: Casper Guldbech Nielsen <whopsec@protonmail.com>

* Updated template to support "iterate-all: true"

Co-authored-by: sandeep <sandeep@projectdiscovery.io>
patch-1
Casper Nielsen 2022-05-20 11:43:00 +02:00 committed by GitHub
parent 737026328b
commit 3c279049c2
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 26 additions and 0 deletions

View File

@ -0,0 +1,26 @@
id: robots-txt-endpoint
info:
name: robots.txt endpoint prober
author: CasperGN,pdteam
severity: info
requests:
- method: GET
path:
- "{{RootURL}}/robots.txt"
- "{{RootURL}}{{endpoint}}"
extractors:
- type: regex
name: endpoint
group: 1
regex:
- '(?m:\s(/[[:alpha:]]+[[:graph:]]+))'
internal: true
iterate-all: true
matchers:
- type: status
status:
- 200