Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.14 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)
Even basic procedures of a financial apparatus like fraud detection, credit scoring, and risk modelling utilizing machine learning (ML) processes are extremely complex, and the versatility of data infrastructure is put in the limelight. The Apache Spark platform of big data is now fashionable for executing big ML loads, owing to the fact that processing has been made in-memory and the ecosystem has been extended. Nevertheless, the performance and observability of every component—which in most cases is a bottleneck—is the Spark History Server (SHS), which is incidentally the component involved in post-execution diagnosis and monitoring. High-frequency financial settings may lead to low SHS performance and difficulty in the speed of troubleshooting, auditability, and understandability of the systems.
The critical analysis of the ways through which the Spark History Server can be optimized with references to large-scale operations in financial systems connected to ML is devoted to the review. The paper is based on the literature available in the sphere of distributed computing, cloud infrastructure, hybrid data processing pipelines, and real-time analytics, and proposes architectural, operational, and performance-based solutions to SHS improvement. Among the most important ones are log compaction, memory management, streaming integration, and the importance of resilient cloud environments as tools to scale SHS. The review and recommendations of how this can be improved in the future also cover future limitations to implementations of SHS. Financial institutions will have an opportunity to increase the speed of their working activity, address new requirements, and provide the further delivery of ML-based insights to the further improvement of SHS functionality.
Keywords:
Spark History Server, Machine Learning, Financial Systems, Log Optimization
Cite Article:
"Optimizing Spark History Server for Large-Scale ML Workloads in Financial Systems", International Journal for Research Trends and Innovation (www.ijrti.org), ISSN:2456-3315, Vol.11, Issue 2, page no.b208-b213, February-2026, Available :http://www.ijrti.org/papers/IJRTI2602126.pdf
Downloads:
00096
ISSN:
2456-3315 | IMPACT FACTOR: 8.14 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.14 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator