Skip to content

Proviesec/nlp-website-string-miner-for-fuzzing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 

Repository files navigation

NLP Website string miner for fuzzing

Find all keywords for your subdomain or folder search with website scrapping and NLP - for FUZZ (Bug Bounty)

Proviesec logo Buy Me A Coffee

Todos

  • txt export
  • parameter export
  • words export
  • Only Parameters or only Words

Installation

  1. git clone https://github.com/Proviesec/nlp-website-string-miner-for-fuzzing.git
  2. cd nlp-website-string-miner-for-fuzzing
  3. pip install -r requirements.txt or pip3 install -r requirements.txt
  4. python -m textblob.download_corpora

Documentation / Tutorials

  • py .\name-miner-crawler.py url deep range export?
  • py .\name-miner-crawler.py https://google.com 1 3 1
  • Deep: says how many levels (links) should be followed. Deep 2 says: all links from the first page and all links present on the next pages are viewed.
  • image

Example:

Screen:

py -3 .\dir-name-crawler.py http://github.com
Output: {'longer', 'contributors', 'twitter', 'octopus', 'enhance', 'javascript', 'secrets', 'partners', 'discussions', 'ohmyzshohmyzsh', 'laptops', 'actions', 'tensorflow', 'dev', 'conflicts', 'support', 'emptybody', 'flow', 'month', 'zero', 'maintain', 'swift', 'repo', 'octocat', 'arm', 'press', 'collaboration', 'jobs', 'zsh', 'cat', 'browser', 'requests', 'ship', 'contribution', 'fortune', 'efficient', 'codespaces', 'react', 'linkedin', 'npm', 'fields', 'software', 'addstatusscreens', 'infrastructure', 'core', 'system', 'eslint', 'plans', 'delightful', 'mag', 'story', 'environment', 'record', 'builds', 'space', 'jump', 'operating', 'topics', 'keeps', 'github', 'atom', 'help', 'kuberneteskubernetes', 'services', 'experiment', 'confirmation', 'requestsconversations', 'jasonetcooctocatclassifier', 'accelerate', 'flutter', 'youtube', 'security', 'source', 'fix', 'program', 'anything', 'inclusion', 'customer', 'host', 'gh', 'desktop', 'automate', 'indexjs', 'secure', 'data', 'vault', 'ohmyzsh', 'company', 'rust', 'pull', 'propose', 'session', 'blog', 'productiongrade', 'sign', 'connect', 'integrations', 'impact', 'build', 'secret', 'voice', 'clicli', 'set', 'onthe', 'automation', 'organization', 'reviews', 'commits', 'stop', 'containers', 'rewind', 'githubthe', 'cloud', 'kubernetes', 'language', 'control', 'handle', 'search', 'rustlangrust', 'heart', 'companies', 'repository', 'privacy', 'depend', 'sync', 'remote', 'install', 'oauth', 'someone', 'paid', 'container', 'hashicorpterraform', 'vm', 'world', 'gatsbyjsgatsby', 'development', 'technologies', 'value', 'input', 'interfaces', 'details', 'sales', 'fast', 'service', 'choice', 'web', 'sponsors', 'tokens', 'puts', 'joshaber', 'macos', 'explore', 'repositories', 'classifier', 'terms', 'simple', 'questions', 'create', 'readme', 'minutes', 'production', 'tensorflowtensorflow', 'conversations', 'thanks', 'funding', 'passing', 'days', 'machine', 'management', 'scale', 'careers', 's', 'vulnerability', 'user', 'desire', 'apps', 'confirmations', 'building', 'tests', 'small', 'check', 'contributions', 'jasonetco', 'flutterflutter', 'homeassistantcore', 'risk', 'developer', 'declarative', 'merge', 'partner', 'branch', 'tab', 'keys', 'stars', 'number', 'developers', 'team', 'site', 'terraform', 'enables', 'gatsby', 'bugs', 'depends', 'top', 'installation', 'home', 'resources', 'enterprise', 'events', 'change', 'others', 'forum', 'ci', 'issues', 'current', 'appleswift', 'compare', 'share', 'review', 'community', 'organizations', 'steps', 'shop', 'vulnerabilities', 'stories', 'marketplace', 'lab', 'date', 'setup', 'cli', 'beautiful', 'product', 'matrix', 'identifies', 'everything', 'level', 'want', 'git', 'map', 'facebookreact', 'guides', 'window', 'api', 'pmarsceill', 'mobile', 'it', 'package', 'invalidate', 'browse', 'results', 'contact', 'size', 'ask', 'sophshep', 'fees', 'features', 'communitydriven', 'configuration', 'workflow', 'experiments', 'stickers', 'answer', 'reach', 'x', 'notify', 'roadmap', 'projects', 'refresh', 'environments', 'npmcli', 'blazing', 'found', 'library', 'changes', 'everyone', 'write', 'docs', 'packages', 'codebase', 'script', 'websites', 'place', 'photographs', 'project', 'public', 'work', 'token', 'manager', 'cover', 'inventions', 'matters', 'code', 'reload', 'squash', 'electron', 'teams', 'pr', 'player', 'platform', 'education', 'supports', 'suggestion', 'key', 'built', 'framework', 'profile', 'green', 'millions', 'octocatclassifier', 'status', 'windows', 'facebook', 'https', 'studio', 'starts', 'push', 'origin', 'request', 'checks', 'registry', 'collections', 'readmemd', 'octocats', 'contribute', 'fund'}


https://textblob.readthedocs.io/en/dev/install.html

Disclaimer: DONT BE A JERK!

Needless to mention, please use this tool very very carefully. The authors won't be responsible for any consequences.