PurCL
diff --git a/‎README.md
Lines changed: 6 additions & 6 deletions b/‎README.md
Lines changed: 6 additions & 6 deletions
diff --git a/‎data/labeldata/labeldata.json
Lines changed: 63 additions & 0 deletions b/‎data/labeldata/labeldata.json
Lines changed: 63 additions & 0 deletions
diff --git a/‎data/papers/labels/bug_detection.md
Lines changed: 18 additions & 0 deletions b/‎data/papers/labels/bug_detection.md
Lines changed: 18 additions & 0 deletions
diff --git a/‎data/papers/labels/fuzzing.md
Lines changed: 6 additions & 0 deletions b/‎data/papers/labels/fuzzing.md
Lines changed: 6 additions & 0 deletions
@@ -32,7 +32,7 @@ We have systematically selected papers from the following venues, which are top-
 
 - Security (Sec)
   - [S&P2023](data/papers/venues/S&P2023/README.md), [USENIXSec2023](data/papers/venues/USENIXSec2023/README.md), [CCS2023](data/papers/venues/CCS2023/README.md), [NDSS2023](data/papers/venues/NDSS2023/README.md)
-  - [S&P2024](data/papers/venues/S&P2024/README.md), [NDSS2024](data/papers/venues/NDSS2024/README.md), [CCS2024](data/papers/venues/CCS2024/README.md)
+  - [S&P2024](data/papers/venues/S&P2024/README.md), [USENIXSec2024](data/papers/venues/USENIXSec2024/README.md), [NDSS2024](data/papers/venues/NDSS2024/README.md), [CCS2024](data/papers/venues/CCS2024/README.md)
 
 - Natural Language Processing (NLP)
   - [ACL2023](data/papers/venues/ACL2023/README.md), [EMNLP2023](data/papers/venues/EMNLP2023/README.md), [NAACL2023](data/papers/venues/NAACL2023/README.md)
@@ -71,9 +71,9 @@ This category focuses on typical tasks in Software Engineering (SE) and Programm
   - [Code Completion](data/papers/labels/code_completion.md)   (22)
   - [Program Repair](data/papers/labels/program_repair.md)   (41)
   - [Program Transformation](data/papers/labels/program_transformation.md)   (31)
-- [Program Testing](data/papers/labels/program_testing.md)   (54)
+- [Program Testing](data/papers/labels/program_testing.md)   (55)
   - [General Testing](data/papers/labels/general_testing.md)   (1)
-  - [Fuzzing](data/papers/labels/fuzzing.md)   (23)
+  - [Fuzzing](data/papers/labels/fuzzing.md)   (24)
   - [Library Testing](data/papers/labels/library_testing.md)   (1)
   - [DBMS Testing](data/papers/labels/DBMS_testing.md)   (1)
   - [Compiler Testing](data/papers/labels/compiler_testing.md)   (4)
@@ -84,16 +84,16 @@ This category focuses on typical tasks in Software Engineering (SE) and Programm
   - [Debugging](data/papers/labels/debugging.md)   (9)
   - [Bug Reproduction](data/papers/labels/bug_reproduction.md)   (2)
   - [Vulnerability Exploitation](data/papers/labels/vulnerability_exploitation.md)   (6)
-- [Static Analysis](data/papers/labels/static_analysis.md)   (133)
+- [Static Analysis](data/papers/labels/static_analysis.md)   (136)
   - [Syntactic Analysis](data/papers/labels/syntactic_analysis.md)   (1)
   - [Pointer Analysis](data/papers/labels/pointer_analysis.md)   (3)
   - [Call Graph Analysis](data/papers/labels/call_graph_analysis.md)   (2)
   - [Data-flow Analysis](data/papers/labels/data-flow_analysis.md)   (8)
   - [Type Inference](data/papers/labels/type_inference.md)   (3)
-  - [Specification Inference](data/papers/labels/specification_inference.md)   (9)
+  - [Specification Inference](data/papers/labels/specification_inference.md)   (12)
   - [Equivalence Checking](data/papers/labels/equivalence_checking.md)   (1)
   - [Code Similarity Analysis](data/papers/labels/code_similarity_analysis.md)   (5)
-  - [Bug Detection](data/papers/labels/bug_detection.md)   (64)
+  - [Bug Detection](data/papers/labels/bug_detection.md)   (67)
   - [Program Verification](data/papers/labels/program_verification.md)   (19)
   - [Program Optimization](data/papers/labels/program_optimization.md)   (4)
   - [Program Decompilation](data/papers/labels/program_decompilation.md)   (8)
 
@@ -7442,6 +7442,37 @@
         ],
         "url": "https://www.usenix.org/system/files/usenixsecurity24-zhao.pdf"
     },
+    "When Threads Meet Interrupts: Effective Static Detection of Interrupt-Based Deadlocks in Linux": {
+        "type": "inproceedings",
+        "key": "chengfeng2024",
+        "title": "When Threads Meet Interrupts: Effective Static Detection of Interrupt-Based Deadlocks in Linux",
+        "author": "Chengfeng Ye, Yuandao Cai, and Charles Zhang,",
+        "booktitle": "33rd USENIX Security Symposium (USENIX Security 24)",
+        "year": "2024",
+        "venue": "USENIXSec2024",
+        "abstract": "Deadlocking is an unresponsive state of software that arises when threads hold locks while trying to acquire other locks that are already held by other threads, resulting in a circular lock dependency. Interrupt-based deadlocks, a specific and prevalent type of deadlocks that occur within the OS kernel due to interrupt preemption, pose significant risks to system functionality, performance, and security. However, existing static analysis tools focus on resource-based deadlocks without characterizing the interrupt preemption. In this paper, we introduce Archerfish, the first static analysis approach for effectively identifying interrupt-based deadlocks in the large-scale Linux kernel. At its core, Archerfish utilizes an Interrupt-Aware Lock Graph (ILG) to capture both regular and interrupt-related lock dependencies, reducing the deadlock detection problem to graph cycle discovery and refinement. Furthermore, Archerfish incorporates four effective analysis components to construct ILG and refine the deadlock cycles, addressing three core challenges, including the extensive interrupt-involving concurrency space, identifying potential interrupt handlers, and validating the feasibility of deadlock cycles. Our experimental results show that Archerfish can precisely analyze the Linux kernel (19.8 MLoC) in approximately one hour. At the time of writing, we have discovered 76 previously unknown deadlocks, with 53 bugs confirmed, 46 bugs already fixed by the Linux community, and 2 CVE IDs assigned. Notably, those found deadlocks are long-latent, hiding for an average of 9.9 years.",
+        "labels": [
+            "static analysis",
+            "bug detection",
+            "specification inference"
+        ],
+        "url": "https://www.usenix.org/system/files/usenixsecurity24-zhao.pdf"
+    },
+    "Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing": {
+        "type": "inproceedings",
+        "key": "Asmita2024",
+        "title": "Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing",
+        "author": "Asmita, Yaroslav Oliinyk,  Michael Scott, Ryan Tsang, Chongzhou Fang, and Houman Homayoun",
+        "booktitle": "33rd USENIX Security Symposium (USENIX Security 24)",
+        "year": "2024",
+        "venue": "USENIXSec2024",
+        "abstract": "BusyBox, an open-source software bundling over 300 essential Linux commands into a single executable, is ubiquitous in Linux-based embedded devices. Vulnerabilities in BusyBox can have far-reaching consequences, affecting a wide array of devices. This research, driven by the extensive use of BusyBox, delved into its analysis. The study revealed the prevalence of older BusyBox versions in real-world embedded products, prompting us to conduct fuzz testing on BusyBox. Fuzzing, a pivotal software testing method, aims to induce crashes that are subsequently scrutinized to uncover vulnerabilities. Within this study, we introduce two techniques to fortify software testing. The first technique enhances fuzzing by leveraging Large Language Models (LLM) to generate target-specific initial seeds. Our study showed a substantial increase in crashes when using LLM-generated initial seeds, highlighting the potential of LLM to efficiently tackle the typically labor-intensive task of generating target-specific initial seeds. The second technique involves repurposing previously acquired crash data from similar fuzzed targets before initiating fuzzing on a new target. This approach streamlines the time-consuming fuzz testing process by providing crash data directly to the new target before commencing fuzzing. We successfully identified crashes in the latest BusyBox target without conducting traditional fuzzing, emphasizing the effectiveness of LLM and crash reuse techniques in enhancing software testing and improving vulnerability detection in embedded systems. Additionally, manual triaging was performed to identify the nature of crashes in the latest BusyBox.",
+        "labels": [
+            "program testing",
+            "fuzzing"
+        ],
+        "url": "https://www.usenix.org/system/files/usenixsecurity24-asmita.pdf"
+    },
     "Gptscan: Detecting logic vulnerabilities in smart contracts by combining gpt with program analysis": {
         "type": "inproceedings",
         "key": "sun2024gptscan",
@@ -10045,6 +10076,8 @@
         ]
     },
     "Hierarchical Repository-Level Code Summarization for Business Applications Using Local LLMs": {
+        "type": "INPROCEEDINGS",
+        "key": "nilesh2025",
         "author": "Nilesh Dhulshette, Sapan Shah, Vinay Kulkarni",
         "title": "Hierarchical Repository-Level Code Summarization for Business Applications Using Local LLMs",
         "url": "https://arxiv.org/pdf/2501.07857",
@@ -10059,6 +10092,8 @@
         "venue": "arXiv2025"
     },
     "Utilizing Precise and Complete Code Context to Guide LLM in Automatic False Positive Mitigation": {
+        "type": "INPROCEEDINGS",
+        "key": "jinbao2024",
         "author": "Jinbao Chen, Hongjing Xiang, Luhao Li, Yu Zhang, Boyao Ding, Qingwei Li",
         "title": "Utilizing Precise and Complete Code Context to Guide LLM in Automatic False Positive Mitigation",
         "url": "https://arxiv.org/pdf/2411.03079",
@@ -10069,6 +10104,34 @@
         ],
         "venue": "arXiv2024"
     },
+    "Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications": {
+        "type": "INPROCEEDINGS",
+        "key": "hermes2024",
+        "author": "Abdullah Al Ishtiaq, Sarkar Snigdha Sarathi Das, Syed Md Mukit Rashid, Ali Ranjbar, Kai Tu, Tianwei Wu, Zhezheng Song, Weixuan Wang, Mujtahid Akon, Rui Zhang, Syed Rafiul Hussain",
+        "title": "Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications",
+        "url": "https://arxiv.org/abs/2310.04381",
+        "abstract": "In this paper, we present Hermes, an end-to-end framework to automatically generate formal representations from natural language cellular specifications. We first develop a neural constituency parser, NEUTREX, to process transition-relevant texts and extract transition components (i.e., states, conditions, and actions). We also design a domain-specific language to translate these transition components to logical formulas by leveraging dependency parse trees. Finally, we compile these logical formulas to generate transitions and create the formal model as finite state machines. To demonstrate the effectiveness of Hermes, we evaluate it on 4G NAS, 5G NAS, and 5G RRC specifications and obtain an overall accuracy of 81-87%, which is a substantial improvement over the state-of-the-art. Our security analysis of the extracted models uncovers 3 new vulnerabilities and identifies 19 previous attacks in 4G and 5G specifications, and 7 deviations in commercial 4G basebands.",
+        "labels": [
+          "static analysis",
+          "bug detection",
+          "specification inference"
+        ],
+        "venue": "USENIXSec2024"
+    },
+    "CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications": {
+        "type": "INPROCEEDINGS",
+        "key": "CellularLint2024",
+        "author": "Mirza Masfiqur Rahman, Imtiaz Karim, and Elisa Bertino",
+        "title": "CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications",
+        "url": "https://www.usenix.org/system/files/usenixsecurity24-rahman.pdf",
+        "abstract": "In recent years, there has been a growing focus on scrutinizing the security of cellular networks, often attributing security vulnerabilities to issues in the underlying protocol design descriptions. These protocol design specifications, typically extensive documents that are thousands of pages long, can harbor inaccuracies, underspecifications, implicit assumptions, and internal inconsistencies. In light of the evolving landscape, we introduce CellularLint—a semi-automatic framework for inconsistency detection within the standards of 4G and 5G, capitalizing on a suite of natural language processing techniques. Our proposed method uses a revamped few-shot learning mechanism on domain-adapted large language models. Pre-trained on a vast corpus of cellular network protocols, this method enables CellularLint to simultaneously detect inconsistencies at various levels of semantics and practical use cases. In doing so, CellularLint significantly advances the automated analysis of protocol specifications in a scalable fashion. In our investigation, we focused on the Non-Access Stratum (NAS) and the security specifications of 4G and 5G networks, ultimately uncovering 157 inconsistencies with 82.67% accuracy. After verification of these inconsistencies on open-source implementations and 17 commercial devices, we confirm that they indeed have a substantial impact on design decisions, potentially leading to concerns related to privacy, integrity, availability, and interoperability.",
+        "labels": [
+          "static analysis",
+          "bug detection",
+          "specification inference"
+        ],
+        "venue": "USENIXSec2024"
+    },
     "C2SaferRust: Transforming C Projects into Safer Rust with NeuroSymbolic Techniques": {
         "author": "Vikram Nitin, Rahul Krishna, Luiz Lemos do Valle, Baishakhi Ray",
         "title": "C2SaferRust: Transforming C Projects into Safer Rust with NeuroSymbolic Techniques",
 
@@ -30,6 +30,12 @@
   - **Labels**: [static analysis](static_analysis.md), [bug detection](bug_detection.md)
 
 
+- [CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications](../venues/USENIXSec2024/paper_6.md), ([USENIXSec2024](../venues/USENIXSec2024/README.md))
+
+  - **Abstract**: In recent years, there has been a growing focus on scrutinizing the security of cellular networks, often attributing security vulnerabilities to issues in the underlying protocol design descriptions. These protocol design specifications, typically extensive documents that are thousands of pages long, can harbor inaccuracies, underspecifications, implicit assumptions, and internal inconsistencies. In light of the evolving landscape, we introduce CellularLint—a semi-automatic framework for inconsi...
+  - **Labels**: [static analysis](static_analysis.md), [bug detection](bug_detection.md), [specification inference](specification_inference.md)
+
+
 - [Closing the Gap: A User Study on the Real-world Usefulness of AI-powered Vulnerability Detection & Repair in the IDE](../venues/ICSE2025/paper_1.md), ([ICSE2025](../venues/ICSE2025/README.md))
 
   - **Abstract**: This paper presents the first empirical study of a vulnerability detection and fix tool with professional software developers on real projects that they own. We implemented DeepVulGuard, an IDE-integrated tool based on state-of-the-art detection and fix models, and show that it has promising performance on benchmarks of historic vulnerability data. DeepVulGuard scans code for vulnerabilities (including identifying the vulnerability type and vulnerable region of code), suggests fixes, provides na...
@@ -144,6 +150,12 @@
   - **Labels**: [static analysis](static_analysis.md), [bug detection](bug_detection.md)
 
 
+- [Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications](../venues/USENIXSec2024/paper_5.md), ([USENIXSec2024](../venues/USENIXSec2024/README.md))
+
+  - **Abstract**: In this paper, we present Hermes, an end-to-end framework to automatically generate formal representations from natural language cellular specifications. We first develop a neural constituency parser, NEUTREX, to process transition-relevant texts and extract transition components (i.e., states, conditions, and actions). We also design a domain-specific language to translate these transition components to logical formulas by leveraging dependency parse trees. Finally, we compile these logical for...
+  - **Labels**: [static analysis](static_analysis.md), [bug detection](bug_detection.md), [specification inference](specification_inference.md)
+
+
 - [How Far Have We Gone in Vulnerability Detection Using Large Language Models](../venues/arXiv2023/paper_5.md), ([arXiv2023](../venues/arXiv2023/README.md))
 
   - **Abstract**: As software becomes increasingly complex and prone to vulnerabilities, automated vulnerability detection is critically important, yet challenging. Given the significant successes of large language models (LLMs) in various tasks, there is growing anticipation of their efficacy in vulnerability detection. However, a quantitative understanding of their potential in vulnerability detection is still missing. To bridge this gap, we introduce a comprehensive vulnerability benchmark VulBench. This bench...
@@ -360,6 +372,12 @@
   - **Labels**: [static analysis](static_analysis.md), [bug detection](bug_detection.md), [benchmark](benchmark.md)
 
 
+- [When Threads Meet Interrupts: Effective Static Detection of Interrupt-Based Deadlocks in Linux](../venues/USENIXSec2024/paper_3.md), ([USENIXSec2024](../venues/USENIXSec2024/README.md))
+
+  - **Abstract**: Deadlocking is an unresponsive state of software that arises when threads hold locks while trying to acquire other locks that are already held by other threads, resulting in a circular lock dependency. Interrupt-based deadlocks, a specific and prevalent type of deadlocks that occur within the OS kernel due to interrupt preemption, pose significant risks to system functionality, performance, and security. However, existing static analysis tools focus on resource-based deadlocks without characteri...
+  - **Labels**: [static analysis](static_analysis.md), [bug detection](bug_detection.md), [specification inference](specification_inference.md)
+
+
 - [Where is it? Tracing the Vulnerability-relevant Files from Vulnerability Reports](../venues/ICSE2024/paper_18.md), ([ICSE2024](../venues/ICSE2024/README.md))
 
   - **Abstract**: With the widely usage of open-source software, supply-chain-based vulnerability attacks, including SolarWind and Log4Shell, have posed significant risks to software security. Currently, people rely on vulnerability advisory databases or commercial software bill of materials (SBOM) to defend against potential risks. Unfortunately, these datasets do not provide finer-grained file-level vulnerability information, compromising their effectiveness. Previous works have not adequately addressed this is...
 
@@ -24,6 +24,12 @@
   - **Labels**: [program testing](program_testing.md), [fuzzing](fuzzing.md)
 
 
+- [Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing](../venues/USENIXSec2024/paper_4.md), ([USENIXSec2024](../venues/USENIXSec2024/README.md))
+
+  - **Abstract**: BusyBox, an open-source software bundling over 300 essential Linux commands into a single executable, is ubiquitous in Linux-based embedded devices. Vulnerabilities in BusyBox can have far-reaching consequences, affecting a wide array of devices. This research, driven by the extensive use of BusyBox, delved into its analysis. The study revealed the prevalence of older BusyBox versions in real-world embedded products, prompting us to conduct fuzz testing on BusyBox. Fuzzing, a pivotal software te...
+  - **Labels**: [program testing](program_testing.md), [fuzzing](fuzzing.md)
+
+
 - [Fuzzing JavaScript Interpreters with Coverage-Guided Reinforcement Learning for LLM-Based Mutation](../venues/ISSTA2024/paper_22.md), ([ISSTA2024](../venues/ISSTA2024/README.md))
 
   - **Abstract**: JavaScript interpreters, crucial for modern web browsers, require an effective fuzzing method to identify security-related bugs. However, the strict grammatical requirements for input present significant challenges. Recent efforts to integrate language models for context- aware mutation in fuzzing are promising but lack the necessary coverage guidance to be fully effective. This paper presents a novel technique called CovRL (Coverage-guided Reinforcement Learning) that combines Large Language Mo...