Comparative Study of Machine Learning Techniques for Insurance Fraud Detection

Navin Duwadi, Anita Sharma

Submitted : 2024-07-11, Published : 2024-08-06.

Abstract

Insurance fraud has been a constant presence in the realm of insurance. However, as strategies and methods for committing insurance fraud have evolved, the frequency and volume of such fraudulent activities have also increased. An example of this is vehicle insurance fraud, which involves collaborating to fabricate false or exaggerated claims related to property damage or personal injuries resulting from an accident. Machine learning techniques seems to be more beneficial and great way to address the fraud in the insurance industry. This paper comprehensively examines existing research through a systematic literature review. This review aims to identify previously attempted approaches and evaluate which machine learning algorithm is best suited for this specific problem. This paper proposes a methodology for identifying fraudulent insurance claims. This approach can significantly improve efficiency and cost savings for insurance companies in handling such cases. The most popular traditional machine learning algorithms used to identify insurance fraud in the auto industry were found to be support vector machine, logistic regression, and random forest.

Keywords

Machine learning, support vector machine, random forest, logistic regression

Full Text:

PDF

References

J. West, M. Bhattacharya, R. Islam, “Intelligent Financial Fraud Detection Practices: An Investigation,” in International Conference on Security and Privacy in Communication Networks, pp. 186–203, 2015. https://doi.org/10.1007/978-3-319-23802-9_16

A. M. Caldeira, W. Gassenferth, M. A. S. Machado, D. J. Santos, “Auditing vehicles claims using neural networks,” in Procedia Computer Science, vol. 55, pp. 62–71, 2015. https://doi.org/10.1016/j.procs.2015.07.008

M. Kirlidog, C. Asuk, “A Fraud Detection Approach with Data Mining in Health Insurance,” Procedia - Social Behavioral Sciences, vol. 62, pp. 989–994, 2012. https://doi.org/10.1016/j.sbspro.2012.09.168

V. Rawte, G. Anuradha, “Fraud detection in health insurance using data mining techniques,” 2015 International Conference on Communication, Information & Computing Technology (ICCICT), Jan. 2015. https://doi.org/10.1109/ICCICT.2015.7045689

M. Al Marri, A. AlAli, “Financial Fraud Detection using Machine Learning Techniques,” RIT Digital Institutional Repository, Rochester Institute of Technology, Dubai, 2020.

K. Nian, H. Zhang, A. Tayal, T. Coleman, Y. Li, “Auto insurance fraud detection using unsupervised spectral ranking for anomaly,” The Journal of Finance and Data Science, vol. 2, no. 1, pp. 58–75, Mar. 2016. https://doi.org/10.1016/j.jfds.2016.03.001

Y. Wang, W. Xu, “Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud,” Decision Support Systems, vol. 105, pp. 87–95, 2018. https://doi.org/10.1016/j.dss.2017.11.001

J. O. Awoyemi, A. O. Adetunmbi, S. A. Oluwadare, “Credit card fraud detection using Machine Learning Techniques: A Comparative Analysis,” 2017 International Conference on Computing Networking and Informatics (ICCNI), Oct. 2017. https://doi.org/10.1109/ICCNI.2017.8123782

M. K. Severino, Y. Peng, “Machine learning algorithms for fraud prediction in property insurance: Empirical evidence using real-world microdata,” Machine Learning with Applications, vol. 5, p. 100074, 2021. https://doi.org/10.1016/j.mlwa.2021.100074

A. Abdallah, M. A. Maarof, A. Zainal, “Fraud detection system: A survey,” Journal of Network and Computer Applications, vol. 68, pp. 90–113, 2016. https://doi.org/10.1016/j.jnca.2016.04.007

M. A. Caruana and L. Grech, “Automobile insurance fraud detection,” Communications in Statistics: Case Studies, Data Analysis and Applications, vol. 7, no. 4, pp. 520–535, 2021. https://doi.org/10.1080/23737484.2021.1986169

V. ambatipudi, “Machine Learning models for Automobile Fraud Detection - A literature Review Agenda,” 20th Global Conference of Actuaries, 2019.

R. Bhowmik, “Detecting Auto Insurance Fraud by Data Mining Techniques,” Journal of Emerging Trends in Computing and Information Sciences, vol. 2, no. 4, pp. 156–162, 2011.

J. Brownlee, “Tune hyperparameters for Classification Machine Learning Algorithms,” Machine Learning Mastery, 2020, [Online] Available: https://machinelearningmastery.com/hyperparameters-for-classification-machine-learning-algorithms, accessed Jul. 8, 2024.

Sheethal H. D., P. Sai Pranavi, Sharanya S. Kumar, Sonika Kariappa, Swathi B. H. Gururaj H. L., “Comparative analysis on vehicle insurances fraud detection using machine learning,” International Journal of Advance Research, Ideas and Innovations in Technology, vol. 6, no. 3, 2020.

P. Dua, S. Bais, “Supervised learning methods for fraud detection in healthcare insurance,” Machine Learning in Healthcare Informatics, vol. 56, pp. 261–285, 2014. https://doi.org/10.1007/978-3-642-40017-9_12

R. Y. Gupta, S. S. Mudigonda, P. K. Baruah, “A comparative study of using various machine learning and deep learning-based fraud detection models for universal health coverage schemes,” International Journal of Engineering Trends and Technology (IJETT), vol. 69, no. 3, pp. 96–102, 2021. https://doi.org/10.14445/22315381/IJETT-V69I3P216

J. Pesantez-Narvaez, M. Guillen, M. Alcañiz, “Predicting motor insurance claims using telematics data—XGboost versus logistic regression,” Risks, vol. 7, no. 2, 2019. https://doi.org/10.3390/risks7020070

G. Kowshalya, M. Nandhini, “Predicting Fraudulent Claims in Automobile Insurance,” 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India, pp. 1338-1343, 2018. https://doi.org/10.1109/ICICCT.2018.8473034

T. Badriyah, L. Rahmaniah, I. Syarif, “Nearest Neighbour and Statistics Method based for Detecting Fraud in Auto Insurance,” 2018 International Conference on Applied Engineering (ICAE), Batam, Indonesia, pp. 1-5, 2018. https://doi.org/10.1109/INCAE.2018.8579155

S. Subudhi, S. Panigrahi, “Use of possibilistic fuzzy C-means clustering for telecom fraud detection,” Computational Intelligence in Data Mining, pp. 633–641, 2017. doi: https://doi.org/10.1007/978-981-10-3874-7_60

R. A. Bauder, T. M. Khoshgoftaar, “Medicare Fraud Detection Using Machine Learning Methods,” 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), Cancun, Mexico, pp. 858-865, 2017. https://doi.org/10.1109/ICMLA.2017.00-48

R. Roy, K. T. George, “Detecting insurance claims fraud using machine learning techniques,” 2017 International Conference on Circuit ,Power and Computing Technologies (ICCPCT), Kollam, India, pp. 1-6, 2017. https://doi.org/10.1109/ICCPCT.2017.8074258

B. Itri, Y. Mohamed, Q. Mohammed, B. Omar, “Performance comparative study of machine learning algorithms for automobile insurance fraud detection,” 2019 Third International Conference on Intelligent Computing in Data Sciences (ICDS), Marrakech, Morocco, pp. 1-4, 2019. https://doi.org/10.1109/ICDS47004.2019.8942277

Y. Kumar, S. Saini, R. Payal, Y. Kumar, A. Professor, “Comparative Analysis for Fraud Detection using Logistic Regression, Random Forest and Support Vector Machine,” International Journal of Research and Analytical Reviews (IJRAR), vol. 7, no. 4, 2020. http://dx.doi.org/10.2139/ssrn.3751339

M. Mathew, N. M. Kunjumon, R. Maria Lalji, K. Susan Skariah, “Motor Insurance Claim Processing and Detection of Fraudulent Claims Using Machine Learning,” International Journal of Future Generation Communication and Networking, vol. 13, no. 3, pp. 1855–1860, 2020.

Y. Li, C. Yan, W. Liu, M. Li, “A principle component analysis-based random forest with the potential nearest neighbor method for automobile insurance fraud identification,” Applied Soft Computing, vol. 70, pp. 1000–1009, 2018. https://doi.org/10.1016/j.asoc.2017.07.027

A. Sheshasaayee, S. S. Thomas, “A purview of the impact of supervised learning methodologies on health insurance fraud detection,” in Advances in Intelligent Systems and Computing, vol. 672, pp. 978–984, 2018. http://dx.doi.org/10.1007/978-981-10-7512-4_98

D. Vineela, P. Swathi, T. Sritha, K. Ashesh, “Fraud Detection in Health Insurance Claims using Machine Learning Algorithms,” International Journal of Recent Technology and Engineering (IJRTE), vol. 8, no. 5, pp. 2999–3003, 2020. http://dx.doi.org/10.35940/ijrte.e6485.018520

G. G. Sundarkumar, V. Ravi, V. Siddeshwar, “One-class support vector machine based undersampling: Application to churn prediction and insurance fraud detection,” 2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), 2015. https://doi.org/10.1109/ICCIC.2015.7435726

S. Subudhi, S. Panigrahi, “Detection of automobile insurance fraud using feature selection and data mining techniques,” International Journal of Rough Sets and Data Analysis, vol. 5, no. 3, pp. 1–20, Jul. 2018. http://dx.doi.org/10.4018/IJRSDA.2018070101

C. Muranda, A. Ali, T. Shongwe, “Detecting Fraudulent Motor Insurance Claims Using Support Vector Machines with Adaptive Synthetic Sampling Method,” ITMS 2021 - 2021 62nd International Scientific Conference on Information Technology and Management Science of Riga Technical University, 2021.

T. Miyato, S.-I. Maeda, M. Koyama, S. Ishii, “Virtual adversarial training: A regularization method for supervised and semi-supervised learning,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 8, pp. 1979–1993, Aug. 2019. https://doi.org/10.1109/TPAMI.2018.2858821

M. Artís, M. Ayuso, M. Guillén, “Detection of automobile insurance fraud with discrete choice models and misclassified claims,” Journal of Risk and Insurance, vol. 69, no. 3, pp. 325–340, 2002. https://doi.org/10.1111/1539-6975.00022

R. R. Popat, J. Chaudhary, "A Survey on Credit Card Fraud Detection Using Machine Learning," 2018 2nd International Conference on Trends in Electronics and Informatics (ICOEI), Tirunelveli, India, pp. 1120-1125, 2018. https://doi.org/10.1109/ICOEI.2018.8553963

A. Kamil, I. Hassan, A. Abraham, “Modeling Insurance Fraud Detection Using Ensemble Combining Classification,” International Journal of Computer Information Systems and Industrial Management Applications, vol. 8, pp. 257–265, 2016.

J. M. Johnson, T. M. Khoshgoftaar, “Medicare fraud detection using neural networks,” Journal of Big Data, vol. 6, no. 1, Dec. 2019. https://doi.org/10.1186/s40537-019-0225-0

S. Bansal, “Vehicle Insurance Claim Fraud Detection,” Kaggle, 2021. [Online] Available: https://www.kaggle.com/datasets/shivamb/vehicle-claim-fraud-detection/data, accessed Jul. 9, 2024.

Article Metrics

Abstract view: 525 times
Download     : 292   times

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.