Skip to content

This repository contains the implementation of our paper "Auto-labelling of Bug Report using Natural Language Processing". Our paper introduces an NLP-based method using bug report attributes, leveraging a neural network for retrieval.

Notifications You must be signed in to change notification settings

aryan-jadon/Duplicate-Bug-Detection-Using-NLP

 
 

Repository files navigation

Duplicate-Bug-Detection

DOI

About Project

This repository contains the implementation of our paper "Auto-labelling of Bug Report using Natural Language Processing".
Duplicate bug report detection in tracking systems saves debugging efforts. Traditional solutions lack clear ranking, 
deterring their use. Our paper introduces an NLP-based method using bug report attributes, leveraging a neural 
network for retrieval.

Cite Our Paper

@INPROCEEDINGS{10126470,
  author={Patil, Avinash and Jadon, Aryan},
  booktitle={2023 IEEE 8th International Conference for Convergence in Technology (I2CT)}, 
  title={Auto-labelling of Bug Report using Natural Language Processing}, 
  year={2023},
  volume={},
  number={},
  pages={1-7},
  doi={10.1109/I2CT57861.2023.10126470}}

Installation Instructions

Pre-requisites:

  • Make sure you have Python and pip installed.

Steps:

  1. Clone the repository (if you haven't already):

    git clone https://github.com/aryan-jadon/DuplicateBugDetection.git
    cd DuplicateBugDetection
  2. (Optional but recommended) Set up a virtual environment:

    python -m venv venv
    source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
  3. Install the required packages:

    pip install -r requirements.txt

After following these steps, all necessary libraries should be installed, and you're ready to execute the project.

Steps to Execute the Project

  1. This is the first script you'll need to run. Its main purpose is to create a mapping of bugs, which can be utilized in subsequent scripts.

    python create_bugs_map.py
  2. After generating the bugs map, the next step is to split the data into training and testing datasets.

    python create_train_test.py
  3. Once you have the training and testing data ready, this is the final script you will run which might contain the main algorithm or process of the project.

    python main_file.py

About

This repository contains the implementation of our paper "Auto-labelling of Bug Report using Natural Language Processing". Our paper introduces an NLP-based method using bug report attributes, leveraging a neural network for retrieval.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%