| # Blaze (Finetuned Variant) |
|
|
| This model is a finetuned version of the **BLAZE** model described in the paper: |
|
|
| **_BLAZE: Cross‑Language and Cross‑Project Bug Localization via Dynamic Chunking and Hard Example Learning_** |
| DOI: [10.5281/zenodo.15122980](https://doi.org/10.5281/zenodo.15122980) :contentReference[oaicite:1]{index=1} |
|
|
| --- |
|
|
| ## 📘 What’s Inside |
|
|
| A Transformer-based bug localization model fine-tuned on additional cross-project datasets to enhance its ability to pinpoint bugs in unseen codebases. |
|
|
| --- |
|
|
| ## 🧪 Fine-tuning Details |
|
|
| - **Starting point:** Pretrained Blaze model from Zenodo. |
| - **Enhancements:** Further trained on a curated dataset using dynamic chunking and hard-negative sampling — all detailed in the original manuscript and accompanying code release. |
|
|
| --- |
|
|
| ## 🔍 Intended Usage |
|
|
| - **Primary task:** Automatic identification of buggy code segments across languages and projects. |
| - **How to use:** Load the model and feed it code snippets to receive localized bug predictions. |
|
|
| --- |
|
|
| ## 📄 Citation |
|
|
| If you use this model, please cite: |
|
|
| > _BLAZE: Cross‑Language and Cross‑Project Bug Localization via Dynamic Chunking and Hard Example Learning_, available at DOI: 10.5281/zenodo.15122980 :contentReference[oaicite:2]{index=2} |
|
|
| --- |
|
|
| ## 📁 Contents of This Repo |
|
|
| - `config.json` |
| - `pytorch_model.bin` (or `tf_model.h5`) |
| - `tokenizer.json` (if applicable) |
| - `README.md` |
|
|
| --- |
|
|
| ## 🛠️ Setup |
|
|
| ```bash |
| pip install transformers huggingface_hub |
| |