European Journal of Information Technologies and Computer Science

Test Automation for Serverless Architectures (FaaS)

Pradeepkumar Palanisamy — 2025-07-30

Serverless architectures, and specifically Function-as-a-Service (FaaS), represent a paradigm shift in application development, offering compelling benefits such as automatic scaling, a pay-per-use cost model, and significantly reduced operational management of underlying infrastructure. This architectural style allows developers to focus on writing business logic in ephemeral, event-triggered functions. However, these very characteristics—distributed components, statelessness, reliance on a multitude of managed cloud services, and the event-driven execution model—introduce unique and complex challenges for traditional software testing methodologies. Ensuring the reliability, performance, security, and correctness of these highly decoupled systems necessitates a robust and tailored approach to test automation. This document provides an in-depth exploration of the critical role test automation plays in the serverless ecosystem. It meticulously examines strategies for designing and implementing automated unit, integration, and end-to-end tests specifically for FaaS applications. Key considerations such as effective mocking of event sources and dependent services, managing distributed state for testing, validating complex event-driven workflows, and choosing appropriate tools and frameworks are discussed in detail. Furthermore, the document outlines best practices for constructing resilient and efficient automated testing pipelines that integrate seamlessly with CI/CD processes, enabling agile development and the consistent delivery of high-quality serverless solutions. It underscores that comprehensive test automation is not merely beneficial but an indispensable component for realizing the full potential of serverless architectures.

Comparative Analysis of Optimized GCD and Hybrid LLM-GCD Approaches for Retail Shelf Space Allocation

Ravi Teja Pagidoju — 2025-08-12

To solve the retail shelf space allocation problem, we need methods that find a good balance between speed and quality. This paper looks at two different methods: an optimized dynamic programming algorithm that uses greatest common divisor (GCD) reduction and a hybrid method that combines Large Language Model (LLM) categorization with parallel GCD optimization. When testing mixed product datasets with 20, 50, and 100 items, it is clear there is a trade-off between speed and optimality. The optimized GCD method finds the best solutions while using the available space, but it takes longer to compute (2.49 ms to 75.74 ms). The hybrid approach shows better computational efficiency (1.89 ms to 10.68 ms) by using smart product grouping and parallel processing, and it uses 91%–98.9% of the space. The hybrid method gives up 3%–9% of possible profit, but it cuts computation time by 78%–85%, which makes it good for real-time retail applications where speed is very important.

Securing API-Based Integrations in Federated Cloud Architectures: A Zero Trust Perspective

Aditya Ramaswamy — 2025-07-12

This paper examines the implementation of zero trust security principles in API-based integrations within federated cloud architectures, with emphasis on enterprise financial and Human Capital Management (HCM) systems. As organizations increasingly adopt multi-cloud strategies, traditional perimeter-based security models fail to protect sensitive data flows across system boundaries. Drawing from implementation experience with Workday integrations to banking and payroll systems, we present a framework for securing API communications in federated environments. The approach encompasses robust identity verification, context-aware authorization, comprehensive encryption, continuous monitoring, and resilient integration design. Case studies demonstrate practical applications, highlighting security improvements and operational efficiencies. Recommendations for organizations transitioning toward zero trust architectures in their integration landscapes are provided.

Domain-Adaptive Pretraining of Transformer-Based Language Models on Medical Texts: A High-Performance Computing Experiment

Charles Kinyua Gitonga — 2025-04-03

This research was to investigate the effect of utilizing high-performance computing (HPC) resources to enhance the adaptability and performance of transformer-based language models. The research was done through intensive domain-specific pretraining in the medical domain. The study aimed to answer the question: Can domain-adaptive pretraining on medical texts significantly improve language model performance metrics such as perplexity while maintaining computational efficiency and addressing ethical considerations? The research utilized a corpus of medical texts. These were carefully split into training and evaluation datasets. Initial model training on NVIDIA A30 GPUs, with 96% GPU utilization, calculated an average perplexity of 73.54. Following iterative refinements—including domain-specific tokenizer optimization, data preprocessing, mixed-precision training, and adjusted learning parameters—the final model achieved an average perplexity of 3.39. The evaluation run processed 7103 samples in 98.02 seconds, with a training loss of 2.405 and an evaluation loss of 2.045, indicating strong generalization and the absence of overfitting. The final model and results were saved for reproducibility and future use. This study was justified by the pressing need for accurate and efficient medical natural language processing (NLP) applications. The application areas are in clinical decision support, patient record summarization, and medical research analysis. The research findings highlight that investing in HPC-driven domain-adaptive pretraining delivers substantial improvements in performance. It also equips medical NLP models with abilities to handle the complexities of domain-specific language effectively. The Ethical considerations of this research were based on optimizing GPU utilization to reduce energy consumption and ensure transparency through reproducible methodologies. We recommend future research to explore larger medical datasets, broader clinical specializations, and diverse transformer architectures while also investigating the transferability of learned representations across related medical subdomains. The advancements could further enhance the applicability of specialized language models in medical research and practice.

Exploring RAG Solutions for a Specific Language: Albanian

Leotrim Ramadani — 2025-02-12

The primary goal of this project is to develop a powerful information retrieval and question-answering system specifically tailored for Albanian- speaking users, bridging the gap between traditional document search methods and modern, context-aware responses. This solution aims to address the unique linguistic and document-processing challenges present in Albanian-language data by combining state-of-the-art Retrieval-Augmented Generation (RAG) techniques with advanced natural language processing (NLP) capabilities. Through the implementation of this RAG solution, we aim to empower organizations, educational institutions, and users in Albanian-speaking regions with fast, accurate, and contextually relevant access to information within their documents. By leveraging vector- based search, large language models, and optimized document processing adapted to the nuances of the Albanian language, this system will simplify information access, reduce reliance on manual searches, and enhance decision-making processes. Retrieval-augmented generation (RAG) is a technique for increasing the accuracy and reliability of generative models of Artificial Intelligence with facts obtained from various external resources. This technique or solution fills a gap in the way LLM works. In other words, LLMs are like neural networks of the brain, usually measured by the number of parameters they contain in the current digital era, organizations and institutions in Albanian-speaking regions face significant challenges in processing, analyzing, and efficiently retrieving information from their documents. Traditional search methods often fail to understand the contextual nuances of the Albanian language, leading to inefficient information retrieval and suboptimal user experiences. Also, the lack of specialized “Natural language processing” or NLP (natural language processing) tools for the Albanian language creates barriers in the effective implementation of document management and question-answering systems.