Foundational Algorithms Underlying Horizontal Processing of Vertically Structured Big Data Using pTrees

Hossain, Mohammad

dc.contributor.author	Hossain, Mohammad
dc.description.abstract	For Big Data, the time taken to process a data mining algorithm is a critical issue. Many reliable algorithms are unusable in the big data environment due to the fact that the processing takes an unacceptable amount of time. Therefore, increasing the speed of processing is very important. To address the speed issue we use horizontal processing of vertically structured data rather than the ubiquitous vertical (scan) processing of horizontal (record) data. pTree technology represents and processes data differently from the traditional horizontal data technologies. In pTree technology, the data is structured column-wise (into bit slices) and the columns are processed horizontally (typically across a few to a few hundred bit level columns), while in horizontal technologies, data is structured row-wise and those rows are processed vertically. pTrees are lossless, compressed and data-mining ready data structures. pTrees are lossless because the vertical bit-wise partitioning that is used in the pTree technology guarantees that all information is retained completely. There is no loss of information in converting horizontal data to this vertical format. pTrees are data-mining ready because the fast, horizontal data mining processes involved can be done without the need to reconstruct the original form of data. This technique has been exploited in various domains and data mining algorithms, ranging from classification, clustering, association rule mining, as well as other data mining algorithms. In this research work, we evaluate and compare the speeds of various foundational algorithms required for using this pTree technology in many data mining tasks.	en_US
dc.publisher	North Dakota State University	en_US
dc.rights	NDSU Policy 190.6.2
dc.title	Foundational Algorithms Underlying Horizontal Processing of Vertically Structured Big Data Using pTrees	en_US
dc.type	Dissertation	en_US
dc.type	Video	en_US
dc.date.accessioned	2016-04-20T14:53:25Z
dc.date.available	2016-04-20T14:53:25Z
dc.date.issued	2016
dc.identifier.uri	http://hdl.handle.net/10365/25573
dc.rights.uri	https://www.ndsu.edu/fileadmin/policy/190.pdf
ndsu.degree	Doctor of Philosophy (PhD)	en_US
ndsu.college	Engineering	en_US
ndsu.department	Computer Science	en_US
ndsu.program	Computer Science	en_US
ndsu.advisor	Perrizo, William

Files in this item

Name:: Mohammad Hossain video.mov
Size:: 84.42Mb
Format:: QuickTime video

View/Open

Name:: Foundational Algorithms Underlying ...
Size:: 442.8Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Show simple item record