Analyzing Access Logs Data using Stream Based Architecture

dc.contributor.authorGautam, Nitendra
dc.date.accessioned2018-04-21T00:02:20Z
dc.date.available2018-04-21T00:02:20Z
dc.date.issued2018
dc.description.abstractWithin the past decades, the enterprise-level IT infrastructure in many businesses have grown from a few to thousands of servers, increasing the digital footprints they produce. These digital footprints include access logs that contain information about different events such as activity related to usage patterns, networks and any hostile activity affecting the network. Apache Hadoop has been one of the most standardized frameworks and is used by many Information Technology (IT) companies for analyzing these log files in distributed batch mode using MapReduce programming model. As these access logs include important information related to security and usage patterns, companies are now looking for an architecture that allows analyzing these logs in real time. To overcome the limitations of the MapReduce based architecture of Hadoop, this paper proposes a new and more efficient data processing architecture using Apache Spark, Kafka and other technologies that can handle both real-time and batch-based data.en_US
dc.identifier.urihttps://hdl.handle.net/10365/28001
dc.publisherNorth Dakota State Universityen_US
dc.rightsNDSU Policy 190.6.2
dc.rights.urihttps://www.ndsu.edu/fileadmin/policy/190.pdf
dc.subject.lcshSpark (Electronic resource : Apache Software Foundation)
dc.subject.lcshApache (Computer file : Apache Group)
dc.subject.lcshStreaming technology (Telecommunications)en_US
dc.subject.lcshElectronic data mining.en_US
dc.subject.lcshWeb usage mining. .en_US
dc.subject.lcshBig data.en_US
dc.subject.lcshApache Hadoop.en_US
dc.subject.lcshMapReduce (Computer file)en_US
dc.subject.lcshSPARK (Electronic resource)en_US
dc.titleAnalyzing Access Logs Data using Stream Based Architectureen_US
dc.typeMaster's paperen_US
ndsu.advisorDenton, Anne M.
ndsu.collegeEngineeringen_US
ndsu.degreeMaster of Science (MS)en_US
ndsu.departmentComputer Scienceen_US
ndsu.programComputer Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Analyzing Access Logs Data using Stream Based Architecture.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format
Description:
Analyzing Access Logs Data using Stream Based Architecture

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed to upon submission
Description: