JOURNAL ARTICLE

Crime status prediction using ensemble learning

Sanjay JainSingh Prashant

Year: 2024 Journal:   i-manager s Journal on Information Technology Vol: 13 (1)Pages: 28-28

Abstract

This paper focuses on crime status prediction through an ensemble methodology applied to extensive datasets obtained from catalog.data.gov, specifically targeting Los Angeles crime incidents since 2020. The research methodology comprises meticulous data collection, rigorous preprocessing, exploratory data analysis, model selection, and comprehensive model evaluation. Initial challenges included data inaccuracies and privacy-preserving measures in location data, necessitating thorough cleaning and transformation processes. Exploratory data analysis revealed crucial insights, including the 'Status' attribute's limited correlation, crime code distributions, areawise crime counts, and temporal patterns. To address class imbalance within 'Status', the Synthetic Minority Oversampling Technique (SMOTE) was applied to balance the dataset. Model evaluation highlighted the superiority of random forest models employing 10 and 20 decision trees, alongside KNN, which demonstrated consistent high accuracy, balanced precision-recall trade-offs, and notable F1 scores in crime status prediction.

Keywords:
Ensemble learning Computer science Machine learning Artificial intelligence Psychology

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.06
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Customer churn and segmentation
Social Sciences →  Business, Management and Accounting →  Marketing
© 2026 ScienceGate Book Chapters — All rights reserved.