Computer Vision Conference (CVC) 2026
16-17 April 2026
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 16 Issue 5, 2025.
Abstract: Recent approaches to binary–source code matching often operate at the intermediate representation (IR) level, with some applying the matching process at the binary level by compiling the source code to binary and then matching it directly with the binary code. Others, though less common, perform matching at the decompiler-generated pseudo-code level by first decompiling the binary code into pseudo-code and then comparing it with the source code. However, all these approaches are limited by the loss of semantic information in the original source code and the introduction of noise during compilation and decompilation, making accurate matching challenging and often requiring specialized expertise. To address these limitations, this study introduces a system for binary–source code matching based on decompilation techniques and Graph analysis (BSMDG) that matches binary code with source code at the source code level. Our method utilizes the Ghidra decompiler in conjunction with a custom-built transpiler to reconstruct high-level C++ source code from binary executables. Subsequently, call graphs (CGs) and control flow graphs (CFGs) are generated for both the original and translated code to evaluate their structural and semantic similarities. To evaluate our system, we used a curated dataset of C++ source code and corresponding binary files collected from the AtCoder website for training and testing. Additionally, a case study was conducted using the widely recognized POJ-104 benchmark dataset to assess the system's generalizability. The results demonstrate the effectiveness of combining decompilation with graph-based analysis, with our system achieving 90% accuracy on POJ-104, highlighting its potential in code clone detection, vulnerability identification, and reverse engineering tasks.
Ghader Aljebreen, Reem Alnanih, Fathy Eassa, Maher Khemakhem, Kamal Jambi and Muhammed Usman Ashraf, “Binary–Source Code Matching Based on Decompilation Techniques and Graph Analysis” International Journal of Advanced Computer Science and Applications(IJACSA), 16(5), 2025. http://dx.doi.org/10.14569/IJACSA.2025.0160525
@article{Aljebreen2025,
title = {Binary–Source Code Matching Based on Decompilation Techniques and Graph Analysis},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2025.0160525},
url = {http://dx.doi.org/10.14569/IJACSA.2025.0160525},
year = {2025},
publisher = {The Science and Information Organization},
volume = {16},
number = {5},
author = {Ghader Aljebreen and Reem Alnanih and Fathy Eassa and Maher Khemakhem and Kamal Jambi and Muhammed Usman Ashraf}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.