An Approach Paper-Compressed Indexing of Tweets for Information Retrieval

Authors

  • Godhani R PG Student, RCOEM, Nagpur, India
  • Naidu D Assistant Professor, RCOEM, Nagpur, India

Keywords:

Inverted Index, Information Retrieval, Compression, Tweets, Inverted index Compression, Decompression

Abstract

In this paper, we present an approach for a compressed indexing of tweets for information retrieval in which, we take short 140-character text messages called tweets, preprocess tweets, and create the index, compress the index, and find percentage accuracy of the compressed index and uncompressed index. The paper also outlines literature review of some of the approaches used in optimizing inverted index compression.

References

Al-Bahadili, Al-Saab. “Compressed Index Query web Search Engine Model”, International Journal of Computer Information Systems

(IJCIS), Vol. 1, No.4, 73-79.

Diego Arroyuelo, Senén Gonzalez.“Document Identifier Reassignment and Run-Length Compressed Inverted Indexes for Improved Search performance”, SIGIR’13, Copyright 2013 ACM.

“VSEncoding: Efficient Coding and Decoding of Integer Lists Via Dynamic Programming”, SIGIR‟10, Copyrights ACM.

Naiyong Ao, Fan Zhang, Douglas S. Stones. “Efficient Parallel Lists and List Intersection and Index Compression Using Graphics Removal of Stop words & Stop Symbols removal Removal of # tag,@,RT. Stanford POS Tagger Tag words as Adjective, Noun, Noun- phrases Build Inverted Index Compression Of Inverted Index Searching in compressed index and uncompressed index Comparing % Accuracy of Compressed index and uncompressed index Collection of Data (Tweets) Processing Units”, Proceedings of VLDB Endowment, Vol.4, Copyright 2011 VLDB Endowment.

Jimmy Lin, Twitter. “Full-Text Indexing for Optimizing Selection operations in Large-Scale data Analytics”, ACM.

Chun Chen, Feng Li.“TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets”, SIGMOD’11, Copyright 2011 ACM.

Akshi Kumar, Teeja Mary Sebastian.“Sentiment analysis on twitter”, IJCSI International Journal of Computer Science Issues, Vol.9, Issue

, No. 3, July 2012.

Downloads

Published

2024-02-26

How to Cite

Godhani, R., & Naidu, D. (2024). An Approach Paper-Compressed Indexing of Tweets for Information Retrieval. COMPUSOFT: An International Journal of Advanced Computer Technology, 3(07), 1012–1015. Retrieved from https://ijact.in/index.php/j/article/view/177

Issue

Section

Original Research Article