This Python script helps automate content discovery on a target domain by fetching and parsing robots.txt
and sitemap.xml
files, checking URL statuses, and saving results. It is useful for reconnaissance in web security assessments.
robots.txt
to find disallowed paths.sitemap.xml
to extract site URLs.discovered_content.txt
for later analysis.Happy content hunting! šµļøāāļøš