Katana VS BBOT Web Spidering #1266
Replies: 2 comments 7 replies
-
Yes, you will not see links like |
Beta Was this translation helpful? Give feedback.
-
Thanks for testing this btw. Our goal is to be the best web spidering tool, but most likely there are still a few improvements we need to make before it's fully up to speed with katana. Most of these will involve better URL extraction logic. Something to note is that right now BBOT strips off the query string (everything after the |
Beta Was this translation helpful? Give feedback.
-
I'm comparing these two tools for web spidering.
I used these two commands:
katana -u https://www.myaccounting.it/ -o katana.txt
and pass results fromhttpx -l katana.txt -o httpx.txt -mc 200
bbot -t https://www.myaccounting.it/ -m httpx -om asset_inventory -c web_spider_distance=4 web_spider_depth=4 web_spider_links_per_page=1000 omit_event_types=["DNS_NAME_UNRESOLVED","URL_UNVERIFIED"] url_extension_httpx_only=[]
I only considered links from https://www.myaccounting.it/ . There are many links exist in httpx result, but not in bbot, A lot from links cointain /wp-json/ and embed. I got diff and attached the links here. I also removed blacklist extensions such as .css .
Is this because of my settings for depth?
httpx.txt
bbot.txt
diff.txt
Beta Was this translation helpful? Give feedback.
All reactions