Source Code Analysis - a rainmana Collection

CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software

Paper • 2107.08760 • Published Jul 19, 2021

From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future

Paper • 2408.02479 • Published Aug 5, 2024

Enhancing Large Language Models for Secure Code Generation: A Dataset-driven Study on Vulnerability Mitigation

Paper • 2310.16263 • Published Oct 25, 2023

Vulnerability Detection with Code Language Models: How Far Are We?

Paper • 2403.18624 • Published Mar 27, 2024

Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++

Paper • 2407.17053 • Published Jul 24, 2024

Efficient Avoidance of Vulnerabilities in Auto-completed Smart Contract Code Using Vulnerability-constrained Decoding

Paper • 2309.09826 • Published Sep 18, 2023

A Vulnerability Code Intent Summary Dataset

Paper • 2504.08180 • Published Apr 11, 2025

Code Security Vulnerability Repair Using Reinforcement Learning with Large Language Models

Paper • 2401.07031 • Published Jan 13, 2024

A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly

Paper • 2312.02003 • Published Dec 4, 2023

A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection

Paper • 2512.16538 • Published 21 days ago

White-Basilisk: A Hybrid Model for Code Vulnerability Detection

Paper • 2507.08540 • Published Jul 11, 2025 • 1

VISION: Robust and Interpretable Code Vulnerability Detection Leveraging Counterfactual Augmentation

Paper • 2508.18933 • Published Aug 26, 2025

LLM-Powered Code Vulnerability Repair with Reinforcement Learning and Semantic Reward

Paper • 2401.03374 • Published Jan 7, 2024

Code Structure-Aware through Line-level Semantic Learning for Code Vulnerability Detection

Paper • 2407.18877 • Published Jul 26, 2024

DeepCode: Open Agentic Coding

Paper • 2512.07921 • Published about 1 month ago • 31

CodeQA: A Question Answering Dataset for Source Code Comprehension

Paper • 2109.08365 • Published Sep 17, 2021

PyRadar: Towards Automatically Retrieving and Validating Source Code Repository Information for PyPI Packages

Paper • 2404.16565 • Published Apr 25, 2024

Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation

Paper • 2412.16135 • Published Dec 20, 2024

DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection

Paper • 2304.00409 • Published Apr 1, 2023 • 1

Malicious Source Code Detection Using Transformer

Paper • 2209.07957 • Published Sep 16, 2022

STraceBERT: Source Code Retrieval using Semantic Application Traces

Paper • 2312.04731 • Published Dec 7, 2023

Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

Paper • 2404.13161 • Published Apr 19, 2024

Comparing Human and LLM Generated Code: The Jury is Still Out!

Paper • 2501.16857 • Published Jan 28, 2025 • 1

Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection

Paper • 2503.01449 • Published Mar 3, 2025 • 4

Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets

Paper • 2501.02628 • Published Jan 5, 2025

Poisoning Programs by Un-Repairing Code: Security Concerns of AI-generated Code

Paper • 2403.06675 • Published Mar 11, 2024

Multi-Agent Penetration Testing AI for the Web

Paper • 2508.20816 • Published Aug 28, 2025

Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation

Paper • 2308.10335 • Published Aug 20, 2023

BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

Paper • 2505.15216 • Published May 21, 2025

Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis

Paper • 2508.14727 • Published Aug 20, 2025

Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis

Paper • 2412.14841 • Published Dec 19, 2024

Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security

Paper • 2507.19399 • Published Jul 25, 2025 • 1

An Empirical Study of Vulnerabilities in Python Packages and Their Detection

Paper • 2509.04260 • Published Sep 4, 2025

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Paper • 2410.11096 • Published Oct 14, 2024 • 13

Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code

Paper • 2311.00889 • Published Nov 1, 2023

CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation

Paper • 2501.08200 • Published Jan 14, 2025 • 1

ARVO: Atlas of Reproducible Vulnerabilities for Open Source Software

Paper • 2408.02153 • Published Aug 4, 2024

Demystifying RCE Vulnerabilities in LLM-Integrated Apps

Paper • 2309.02926 • Published Sep 6, 2023

ReCode: Robustness Evaluation of Code Generation Models

Paper • 2212.10264 • Published Dec 20, 2022 • 1

MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts?

Paper • 2507.19598 • Published Jul 25, 2025

The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

Paper • 2504.11711 • Published Apr 16, 2025

IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities

Paper • 2405.17238 • Published May 27, 2024

QLCoder: A Query Synthesizer For Static Analysis of Security Vulnerabilities

Paper • 2511.08462 • Published Nov 11, 2025

Security Weaknesses of Copilot Generated Code in GitHub

Paper • 2310.02059 • Published Oct 3, 2023

CodeFort: Robust Training for Code Generation Models

Paper • 2405.01567 • Published Apr 11, 2024

Understanding the Effectiveness of Large Language Models in Detecting Security Vulnerabilities

Paper • 2311.16169 • Published Nov 16, 2023 • 1

PATCHEVAL: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities

Paper • 2511.11019 • Published Nov 14, 2025 • 1

RedCode: Risky Code Execution and Generation Benchmark for Code Agents

Paper • 2411.07781 • Published Nov 12, 2024 • 1

Can Large Language Models Find And Fix Vulnerable Software?

Paper • 2308.10345 • Published Aug 20, 2023

Deep Learning based Vulnerability Detection: Are We There Yet?

Paper • 2009.07235 • Published Sep 3, 2020

On the Adversarial Robustness of Instruction-Tuned Large Language Models for Code

Paper • 2411.19508 • Published Nov 29, 2024

Human-Written vs. AI-Generated Code: A Large-Scale Study of Defects, Vulnerabilities, and Complexity

Paper • 2508.21634 • Published Aug 29, 2025

CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models

Paper • 2206.00052 • Published May 31, 2022 • 1

Shellcode_IA32: A Dataset for Automatic Shellcode Generation

Paper • 2104.13100 • Published Apr 27, 2021

A ground-truth dataset of real security patches

Paper • 2110.09635 • Published Oct 18, 2021

MetaReflection: Learning Instructions for Language Agents using Past Reflections

Paper • 2405.13009 • Published May 13, 2024

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence

Paper • 2510.00240 • Published Sep 30, 2025 • 1

Symbol Preference Aware Generative Models for Recovering Variable Names from Stripped Binary

Paper • 2306.02546 • Published Jun 5, 2023 • 1

A Repository-Level Dataset For Detecting, Classifying and Repairing Software Vulnerabilities

Paper • 2401.13169 • Published Jan 24, 2024

SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks

Paper • 2506.11791 • Published Jun 13, 2025

CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks

Paper • 2507.05269 • Published Jul 3, 2025 • 1

RedCoder: Automated Multi-Turn Red Teaming for Code LLMs

Paper • 2507.22063 • Published Jun 25, 2025 • 2

Cross-Domain Evaluation of Transformer-Based Vulnerability Detection on Open & Industry Data

Paper • 2509.09313 • Published Sep 11, 2025 • 2

How Far Have We Gone in Stripped Binary Code Understanding Using Large Language Models

Paper • 2404.09836 • Published Apr 15, 2024

Agent That Debugs: Dynamic State-Guided Vulnerability Repair

Paper • 2504.07634 • Published Apr 10, 2025

AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI)

Paper • 2506.08885 • Published Jun 10, 2025

Revisiting Pre-trained Language Models for Vulnerability Detection

Paper • 2507.16887 • Published Jul 22, 2025 • 1

Leveraging multi-task learning to improve the detection of SATD and vulnerability

Paper • 2501.15934 • Published Jan 27, 2025 • 2

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

Paper • 2509.13755 • Published Sep 17, 2025 • 19

VulDeePecker: A Deep Learning-Based System for Vulnerability Detection

Paper • 1801.01681 • Published Jan 5, 2018

Is Your AI-Generated Code Really Safe? Evaluating Large Language Models on Secure Code Generation with CodeSecEval

Paper • 2407.02395 • Published Jul 2, 2024

Automating the Detection of Code Vulnerabilities by Analyzing GitHub Issues

Paper • 2501.05258 • Published Jan 9, 2025

TRACED: Execution-aware Pre-training for Source Code

Paper • 2306.07487 • Published Jun 13, 2023 • 1

Devign: Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks

Paper • 1909.03496 • Published Sep 8, 2019

An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation

Paper • 2408.09078 • Published Aug 17, 2024

VulSolver: Vulnerability Detection via LLM-Driven Constraint Solving

Paper • 2509.00882 • Published Aug 31, 2025

ProSec: Fortifying Code LLMs with Proactive Security Alignment

Paper • 2411.12882 • Published Nov 19, 2024 • 2

SecureCode v2.0: A Production-Grade Dataset for Training Security-Aware Code Generation Models

Paper • 2512.18542 • Published 19 days ago • 2

Reasoning with LLMs for Zero-Shot Vulnerability Detection

Paper • 2503.17885 • Published Mar 22, 2025

VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection

Paper • 2512.07533 • Published about 1 month ago • 2

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

Paper • 2103.15543 • Published Mar 29, 2021

Learning to Quantize Vulnerability Patterns and Match to Locate Statement-Level Vulnerabilities

Paper • 2306.06109 • Published May 26, 2023

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

Paper • 2310.01152 • Published Oct 2, 2023