An Approach Paper-Compressed Indexing of Tweets for Information Retrieval
Keywords:
Inverted Index, Information Retrieval, Compression, Tweets, Inverted index Compression, DecompressionAbstract
In this paper, we present an approach for a compressed indexing of tweets for information retrieval in which, we take short 140-character text messages called tweets, preprocess tweets, and create the index, compress the index, and find percentage accuracy of the compressed index and uncompressed index. The paper also outlines literature review of some of the approaches used in optimizing inverted index compression.
References
Al-Bahadili, Al-Saab. “Compressed Index Query web Search Engine Model”, International Journal of Computer Information Systems
(IJCIS), Vol. 1, No.4, 73-79.
Diego Arroyuelo, Senén Gonzalez.“Document Identifier Reassignment and Run-Length Compressed Inverted Indexes for Improved Search performance”, SIGIR’13, Copyright 2013 ACM.
“VSEncoding: Efficient Coding and Decoding of Integer Lists Via Dynamic Programming”, SIGIR‟10, Copyrights ACM.
Naiyong Ao, Fan Zhang, Douglas S. Stones. “Efficient Parallel Lists and List Intersection and Index Compression Using Graphics Removal of Stop words & Stop Symbols removal Removal of # tag,@,RT. Stanford POS Tagger Tag words as Adjective, Noun, Noun- phrases Build Inverted Index Compression Of Inverted Index Searching in compressed index and uncompressed index Comparing % Accuracy of Compressed index and uncompressed index Collection of Data (Tweets) Processing Units”, Proceedings of VLDB Endowment, Vol.4, Copyright 2011 VLDB Endowment.
Jimmy Lin, Twitter. “Full-Text Indexing for Optimizing Selection operations in Large-Scale data Analytics”, ACM.
Chun Chen, Feng Li.“TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets”, SIGMOD’11, Copyright 2011 ACM.
Akshi Kumar, Teeja Mary Sebastian.“Sentiment analysis on twitter”, IJCSI International Journal of Computer Science Issues, Vol.9, Issue
, No. 3, July 2012.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2014 COMPUSOFT: An International Journal of Advanced Computer Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.
©2023. COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY by COMPUSOFT PUBLICATION is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY. Permissions beyond the scope of this license may be available at Creative Commons Attribution 4.0 International Public License.