Analyzing Access Logs Data using Stream Based Architecture

Gautam, Nitendra

Analyzing Access Logs Data using Stream Based Architecture

Files

Analyzing Access Logs Data using Stream Based Architecture.pdf (1.16 MB)

Date

2018

Authors

Gautam, Nitendra

Publisher

North Dakota State University

Abstract

Within the past decades, the enterprise-level IT infrastructure in many businesses have grown from a few to thousands of servers, increasing the digital footprints they produce. These digital footprints include access logs that contain information about different events such as activity related to usage patterns, networks and any hostile activity affecting the network. Apache Hadoop has been one of the most standardized frameworks and is used by many Information Technology (IT) companies for analyzing these log files in distributed batch mode using MapReduce programming model. As these access logs include important information related to security and usage patterns, companies are now looking for an architecture that allows analyzing these logs in real time. To overcome the limitations of the MapReduce based architecture of Hadoop, this paper proposes a new and more efficient data processing architecture using Apache Spark, Kafka and other technologies that can handle both real-time and batch-based data.

URI

https://hdl.handle.net/10365/28001

Collections

Computer Science Masters Papers

Full item page

Analyzing Access Logs Data using Stream Based Architecture

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections