Hybrid CNN–LSTM Models for Cross-Project Fault Prediction: Robust Generalization Under Dataset Shift

Authors

  • Sidharth Rao Computer Science Department, SRM Institute of Science Department and Technology, Chennai, Tamil Nadu, India. Author
  • Arjun Patel Computer Science Department, SRM Institute of Science Department and Technology, Chennai, Tamil Nadu, India. Author
  • Neha Gupta Information Technology, SRM Institute of Science Department and Technology, Chennai, Tamil Nadu, India. Author
  • Kavya Shah Information Technology, SRM Institute of Science Department and Technology, Chennai, Tamil Nadu, India. Author

DOI:

https://doi.org/10.63282/3050-9262.IJAIDSML-V6I4P133

Keywords:

Cross-Project Fault Prediction, Dataset Shift, Domain Generalization, Hybrid Deep Learning, CNN, LSTM, Software Defect Prediction, Ci/Cd Analytics, Robustness, Explainable AI

Abstract

Abstract: Cross-project fault prediction (CPFP) aims to learn defect predictors from one or more source projects and deploy them on a different target project where labels may be unavailable, incomplete, delayed, or noisy. Despite decades of progress in software defect prediction, generalization in CPFP remains brittle because target projects commonly exhibit dataset shift: changes in metric distributions, process behaviors, architectural patterns, operational platforms, and data pipelines that violate the implicit i.i.d. assumptions of standard supervised learning. This manuscript proposes a shift-aware hybrid CNN–LSTM framework tailored to CPFP under realistic dataset shift. The model combines convolutional feature extractors (for local, interaction-level patterns among engineered metrics or change descriptors) with recurrent sequence modeling (for temporal and cross-change dependencies), and integrates training strategies that stabilize performance under covariate and conditional shift. We formalize the CPFP learning objective under shift, define a practical shift taxonomy grounded in modern enterprise software contexts, and introduce a training pipeline using robust scaling, project-conditional normalization, and shift-mitigation regularization. The approach is designed for operational feasibility: it supports constrained edge environments and embedded telemetry streams, aligns with CI/CD governance processes, and incorporates explainability requirements for high-stakes domains. A worked example illustrates how dataset shift can invert model rankings across projects and how shift-aware training reduces this volatility. The manuscript concludes with a reproducible evaluation protocol, threat analysis, and deployment guidance for fault prediction in cloud-native and regulated settings.

References

[1] J. Quiñonero-Candela, M. Sugiyama, A. Schwaighofer, and N. D. Lawrence, Dataset Shift in Machine Learning. Cambridge, MA, USA: MIT Press, 2009.

[2] S. J. Pan and Q. Yang, “A survey on transfer learning,” IEEE Trans. Knowl. Data Eng., vol. 22, no. 10, pp. 1345–1359, Oct. 2010.

[3] T. Zimmermann, N. Nagappan, H. Gall, E. Giger, and B. Murphy, “Cross-project defect prediction: a large-scale experiment on data from one project,” in Proc. 7th Joint Meeting Eur. Softw. Eng. Conf. ACM SIGSOFT Symp. Found. Softw. Eng. (ESEC/FSE), 2009, pp. 91–100.

[4] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proc. IEEE, vol. 86, no. 11, pp. 2278–2324, Nov. 1998.

[5] S. K. Gunda, "Analyzing Machine Learning Techniques for Software Defect Prediction: A Comprehensive Performance Comparison," 2024 Asian Conference on Intelligent Technologies (ACOIT), KOLAR, India, 2024, pp. 1-5, https://doi.org/10.1109/ACOIT62457.2024.10939610.

[6] I. Manga, “Edge Software Engineering for Lightweight AI: Real-Time Environmental Data Processing with Embedded Systems,” Journal of Computational Analysis and Applications (JoCAAA), vol. 34, no. 6, pp. 88–104, Jun. 2025.

[7] S. R. Gudi, “Enhancing Reliability in Java Enterprise Systems through Comparative Analysis of Automated Testing Frameworks,” International Journal of Emerging Trends in Computer Science and Information Technology, vol. 4, no. 2, pp. 151–160, 2023, doi: 10.63282/3050-9246.IJETCSIT-V4I2P115.

[8] V. K. Reddy Mittamidi, “Leveraging AI and ML for Predictive Monitoring and Error Mitigation in Change Data Capture Pipelines,” IJETCSIT, vol. 6, no. 3, pp. 104–111, Aug. 21, 2025. https://ijetcsit.org/index.php/ijetcsit/article/view/515

[9] S. K. Gunda, “Comparative Analysis of Machine Learning Models for Software Defect Prediction,” in Proc. Int. Conf. Power, Energy, Control and Transmission Systems (ICPECTS), Chennai, India, 2024, pp. 1–6, doi: 10.1109/ICPECTS62210.2024.10780167.

[10] I. Manga, “AutoML for All: Democratizing Machine Learning Model Building with Minimal Code Interfaces,” in Proc. 3rd Int. Conf. Sustainable Computing and Data Communication Systems (ICSCDS), Erode, India, 2025, pp. 347–352, doi: 10.1109/ICSCDS65426.2025.11167529.

[11] R. R. Thalakanti and S. S. Goud Bandari, “Intelligent Continuous Integration and Delivery for Banking Systems using Machine Learning Driven Risk Detection with Real World Deployment Evaluation,” International Journal of AI, BigData, Computational and Management Studies, vol. 5, no. 4, pp. 168–175, 2024, doi: 10.63282/3050-9416.IJAIBDCMS-V5I4P118.

[12] I. Manga, “Unified Data Engineering for Smart Mobility: Real-Time Integration of Traffic, Public Transport, and Environmental Data,” in Proc. 5th Int. Conf. Soft Computing for Security Applications (ICSCSA), Salem, India, 2025, pp. 1348–1353, doi: 10.1109/ICSCSA66339.2025.11170800.

[13] S. S. G. Bandari, S. D. Sivva, and R. R. Thalakanti, “Regulatory Grade Fraud Detection using Explainable Artificial Intelligence with Auditable Decision Pathways and Empirical Validation on Banking Data,” International Journal of Artificial Intelligence, Data Science, and Machine Learning, vol. 5, no. 3, pp. 139–147, 2024, doi: 10.63282/3050-9262.IJAIDSML-V5I3P115.

[14] I. Manga, “Towards Explainable AI: A Framework for Interpretable Deep Learning in High-Stakes Domains,” in Proc. 5th Int. Conf. Soft Computing for Security Applications (ICSCSA), Salem, India, 2025, pp. 1354–1360, doi: 10.1109/ICSCSA66339.2025.11170778.

[15] S. R. Gudi, “AI-Driven Fax-to-Digital Prescription Automation: A Cloud-Native Framework Using OCR, Machine Learning, and Microservices for Pharmacy Operations,” International Journal of Emerging Research in Engineering and Technology, vol. 5, no. 1, pp. 111–116, 2024, doi: 10.63282/3050-922X.IJERET-V5I1P113.

[16] G. V. Krishna, B. D. Reddy, and T. Vrindaa, “EmoVision: An Intelligent Deep Learning Framework for Emotion Understanding and Mental Wellness Assistance in Human Computer Interaction,” IJAIDSML, vol. 6, no. 4, pp. 14–20, Oct. 2025. https://ijaidsml.org/index.php/ijaidsml/article/view/295

[17] S. K. Gunda, S. D. R. Yettapu, S. Bodakunti, and S. B. Bikki, “Decision Intelligence Methodology for AI-Driven Agile Software Lifecycle Governance and Architecture-Centered Project Management,” IJAIDSML, vol. 4, no. 1, pp. 102–108, Mar. 30, 2023, doi: 10.63282/3050-9262.IJAIDSML-V4I1P112.

[18] I. Manga, “Federated Learning at Scale: A Privacy-Preserving Framework for Decentralized AI Training,” in Proc. 5th Int. Conf. Soft Computing for Security Applications (ICSCSA), Salem, India, 2025, pp. 110–115, doi: 10.1109/ICSCSA66339.2025.11170780.

[19] R. R. Thalakanti, S. S. Goud Bandari, and S. D. Sivva, “Federated Learning for Privacy Preserving Fraud Detection across Financial Institutions: Architecture Protocols and Operational Governance,” International Journal of Emerging Research in Engineering and Technology, vol. 5, no. 2, pp. 108–114, 2024, doi: 10.63282/3050-922X.IJERET-V5I2P111.

[20] S. R. Gudi, “Design and Evaluation of Secure Microservices Architecture for HIPAA-Compliant Prescription Processing on AWS and OpenShift,” International Journal of Artificial Intelligence, Data Science, and Machine Learning, vol. 5, no. 2, pp. 144–149, 2024, doi: 10.63282/3050-9262.IJAIDSML-V5I2P116.

[21] S. K. RaoVanama, “AI-Augmented CI/CD Pipeline Optimization for Scalable Cloud-Native Deployment,” IJAIDSML, vol. 5, no. 4, pp. 175–187, Dec. 30, 2024. https://ijaidsml.org/index.php/ijaidsml/article/view/368

[22] S. K. Gunda, “A Deep Dive into Software Fault Prediction: Evaluating CNN and RNN Models,” in Proc. Int. Conf. Electronic Systems and Intelligent Computing (ICESIC), Chennai, India, 2024, pp. 224–228, doi: 10.1109/ICESIC61777.2024.10846549.

[23] T. Raikar and V. Apelagunta, “Implementing SAP Fiori in S/4HANA transitions: Key guidelines, challenges, strategic implications, AI integration recommendations,” Journal of Engineering Research and Sciences, vol. 4, no. 11, pp. 1–9, 2025, doi: 10.55708/JS0411001.

[24] I. Manga, “Scalable Graph Neural Networks for Global Knowledge Representation and Reasoning,” in Proc. 9th Int. Conf. Inventive Systems and Control (ICISC), Coimbatore, India, 2025, pp. 1399–1404, doi: 10.1109/ICISC65841.2025.11188341.

[25] S. R. Gudi, “Leveraging Predictive Analytics and Redis-Backed Caching to Optimize Specialty Medication Fulfillment and Pharmacy Inventory Management,” International Journal of AI, BigData, Computational and Management Studies, vol. 5, no. 3, pp. 155–160, 2024, doi: 10.63282/3050-9416.IJAIBDCMS-V5I3P116.

[26] V. K. Reddy Mittamidi, “AI/ML Powered Intelligent Root Cause Analysis and Automated Remediation for Multi System Data Integrity Issues,” IJAIBDCMS, vol. 6, no. 4, pp. 133–141, Nov. 14, 2025. https://ijaibdcms.org/index.php/ijaibdcms/article/view/338

[27] S. K. Gunda, “The Future of Software Development and the Expanding Role of ML Models,” International Journal of Emerging Research in Engineering and Technology, vol. 4, no. 2, pp. 126–129, 2023, doi: 10.63282/3050-922X.IJERET-V4I2P113.

[28] T. Raikar, “High-Performance In-Memory Computing: A Research Study on SAP S/4 HANA Database Layer,” American Journal of Technology, vol. 4, no. 2, pp. 93–113, 2025, doi: 10.58425/ajt.v4i2.449.

[29] S. R. Gudi, “Ensuring Secure and Compliant Fax Communication: Anomaly Detection and Encryption Strategies for Data in Transit,” in Proc. 4th Int. Conf. Innovative Mechanisms for Industry Applications (ICIMIA), Tirupur, India, 2025, pp. 786–791, doi: 10.1109/ICIMIA67127.2025.11200537.

[30] K. V. Alluri, “Using Salesforce CRM and Deep Learning (CNN) Techniques to Improve Patient Journey Mapping and Engagement in Small and Medium Healthcare Organizations,” IJAIDSML, vol. 6, no. 4, pp. 101–109, Nov. 22, 2025. https://ijaidsml.org/index.php/ijaidsml/article/view/330

[31] S. R. Gudi, “A Comparative Analysis of Pivotal Cloud Foundry and OpenShift Cloud Platforms,” The American Journal of Applied Sciences, vol. 7, no. 07, pp. 20–29, 2025, doi: 10.37547/tajas/Volume07Issue07-03.

[32] R. R. Thalakanti, “Enhancing Convergence in Fully Connected Neural Networks via Optimized Backpropagation,” in Proc. 2nd Int. Conf. Computing and Data Science (ICCDS), Chennai, India, 2025, pp. 1–6, doi: 10.1109/ICCDS64403.2025.11209625.

[33] S. R. Gudi, “Monitoring and Deployment Optimization in Cloud-Native Systems: A Comparative Study Using OpenShift and Helm,” in Proc. 4th Int. Conf. Innovative Mechanisms for Industry Applications (ICIMIA), Tirupur, India, 2025, pp. 792–797, doi: 10.1109/ICIMIA67127.2025.11200594.

[34] Khokrale, R. (2025). The Role of AI in Supply Chain Optimization: Enhancing Efficiency through Predictive Analytics. Journal of Procurement and Supply Chain Management, 4(2), 55–75. https://doi.org/10.58425/jpscm.v4i2.442

[35] S. R. Gudi, “Deconstructing Monoliths: A Fault-Aware Transition to Microservices with Gateway Optimization using Spring Cloud,” in Proc. 6th Int. Conf. Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2025, pp. 815–820, doi: 10.1109/ICESC65114.2025.11212326.

[36] K. V. Alluri, “Salesforce CRM Framework for Real Time DeFi Portfolio Intelligence and Customer Engagement Forecasting in Web3 Based Decentralized Finance Ecosystems Using ML Techniques,” IJAIBDCMS, vol. 6, no. 4, pp. 99–107, Nov. 6, 2025. https://ijaibdcms.org/index.php/ijaibdcms/article/view/319

[37] S. R. Gudi, “Enhancing optical character recognition (OCR) accuracy in healthcare prescription processing using artificial neural networks,” European Journal of Artificial Intelligence and Machine Learning, vol. 4, no. 6, 2025, doi: 10.24018/ejai.2025.4.6.79.

[38] Sivva SD, Thalakanti RR, Bandari SSG, Yettapu SDR. AI-Driven Decision Intelligence for Agile Software Lifecycle Governance: An Architecture-Centered Framework Integrating Machine Learning Defect Prediction and Automated Testing. IJETCSIT 2023 Dec.30;4(4):167-72. Available from: https://ijetcsit.org/index.php/ijetcsit/article/view/554.

Published

2025-12-30

Issue

Section

Articles

How to Cite

1.
Rao S, Patel A, Gupta N, Shah K. Hybrid CNN–LSTM Models for Cross-Project Fault Prediction: Robust Generalization Under Dataset Shift. IJAIDSML [Internet]. 2025 Dec. 30 [cited 2026 Mar. 9];6(4):241-5. Available from: https://ijaidsml.org/index.php/ijaidsml/article/view/455