Optimizing AI Model Inference on Serverless Cloud Platforms: A Scalable Approach

Journal Title: International Journal of Current Science Research and Review - Year 2025, Vol 8, Issue 05

Abstract

The increasing prevalence of Artificial Intelligence (AI) and Machine Learning (ML) models across various industries has highlighted the critical need for efficient and scalable deployment strategies. Traditional deployment methods often struggle with adapting to fluctuating demands and maintaining cost-effectiveness. Serverless computing has emerged as a promising solution to address these challenges. This paper investigates the deployment of AI models within serverless architectures on Amazon Web Services (AWS), specifically focusing on AWS Lambda and Knative. The study analyzes the limitations of conventional deployment approaches and proposes innovative strategies leveraging the capabilities of serverless technologies. Furthermore, it presents a rigorous evaluation of the performance characteristics of these serverless deployment strategies, discusses crucial security and privacy considerations, incorporates illustrative real-world case studies, and outlines potential future research directions.

Authors and Affiliations

Prudhvi Naayini, Chiranjeevi Bura,

Keywords

Related Articles

The Influence of Industrial Regulations, Strategic Alliances, and Market Competition on Perceived of Firm Performance (Case Study of a Private Construction Sector Companies in Balikpapan)

Providing infrastructure is one of the fundamental sectors that must be realized in the context of economic development so that general prosperity can be achieved. In its implementation, the Government has determined pro...

The Epitome of Ethnic Integration: The Formation and Development of Hui Nationality

In this paper, the origin of the Hui people in China is researched, the formation history of the Hui people is stated in detail, the development and evolution of the Hui people are described in detail, and the origin, fo...

Cultural Differences in Tourist Behavior: A Cross-Cultural Psychological Study

Tourism is a global phenomenon that bridges cultural divides, yet it is also shaped profoundly by the diverse cultural identities of those who travel. This study examines the psychological and behavioral differences in t...

The Influence of Service Quality and Price on KRL Commuter Line Route Solo - Jogja to Customer Loyalty

Transportation as an important element in human life, plays a role in facilitating daily activities and supporting the economy. But because of the heavy traffic caused by so many motorized vehicles, people are looking fo...

Exogenous Cellulase Enzyme Supplementation in Complete Feed Based On Fermentation of Banana Stems for Nutritional Consumption of Beef Cattle

The obstacle to increasing beef cattle production in East Nusa Tenggara, especially Timor Island, is that feed by farmers is still below the dry matter requirements for beef cattle. This fact shows that it is necessary t...

Download PDF file
  • EP ID EP765351
  • DOI 10.47191/ijcsrr/V8-i5-02
  • Views 16
  • Downloads 0

How To Cite

Prudhvi Naayini, Chiranjeevi Bura, (2025). Optimizing AI Model Inference on Serverless Cloud Platforms: A Scalable Approach. International Journal of Current Science Research and Review, 8(05), -. https://europub.co.uk/articles/-A-765351