|
[1] Aalbersberg, I.J. & Sijstermans, F. (1990). InfoGuide: A full-text document retrieval system. In A.M. Tjoa & R. Wagner (Eds.), Proceedings of the international conference of database and expert systems applications (DEXA'90), (pp.12-21). Berlin: Springer-Verlag. [2] Anh, V.N. & Moffat, A. (1998). Compressed inverted files with reduced decoding overheads. In R. Wilkinson, B. Croft, and C.V. Rijsbergen (Eds.), Proceedings of the 21st annual international ACM SIGIR conference on Research and Development in Information Retrieval, (pp. 290-297), Melbourne. New York: ACM Press. [3] Anh, V.N. & Moffat, A. (2005). Inverted index compression using word-aligned binary codes. Information Retrieval, 8(1), 151-166. [4] Bell, T.C., Moffat, A., Nevill-Manning, C.G., Witten, I.H., and Zobel, J. (1993). Data compression in full-text retrieval systems. Journal of the American Society for Information Science, 44(9), 508-531. [5] Breslau, L., Cao, P., Fan, L., Phillips, G., and Shenker, S. (1999). Web caching and zipf-like distributions: evidence and implications. In Proceedings of Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE INFOCOM '99), (pp. 126-134), New York, Mar. Los Alamitos, CA: IEEE Computer Society Press. [6] Brown, E.W., Callan, J.P., and Croft, W.B. (1994). Fast incremental indexing for full-text information retrieval. In Proceedings of the 20th Very Large Data Base Conference (VLDB'94) , (pp. 192-202). [7] Cheng, C.S., Shann, J.J.J., and Chung, C.P. (2005). Unique-order interpolative coding for fast querying and space-efficient indexing in information retrieval systems. To appear in Information Processing and Management. [8] Cheng, C.S., Shann, J.J.J., and Chung, C.P. (2004). A Unique-Order Interpolative Code for Fast Querying and Space-Efficient Indexing in Information Retrieval Systems. In P.K. Srimani et al. (Eds.), Proceedings of ITCC 2004 International Conference on Information Technology: Coding and Communications Volume 2, (pp. 229-235), Las Vegas, Nevada, Apr. Los Alamitos, CA: IEEE Computer Society Press. [9] Elias, P. (1975). Universal codeword sets and representations of the integers. IEEE Transactions on Information Theory, IT-21(2), 194-203. [10]Faloutsos, C. (1985). Access methods for text. ACM Computing Surveys, 17(1), 49-74. [11]Fraenkel, A.S. & Klein, S.T. (1985). Novel Compression of sparse bit-string-Preliminary report. In A. Apostolico & Z. Galil (Eds.) Combinatorial Algorithms on Words: Vol. 12, NATO ASI Serials F. (pp. 169-183). Berlin: Springer-Verlag. [12]Frakes, W.B. & Baeza-Yates, R. (1992). Information Retrieval: Data Structures and Algorithms. Upper Saddle River, NJ: Prentice Hall. [13]Gallager, R.G. & Van Voorhis, D.C. (1975). Optimal source codes for geometrically distributed alphabets. IEEE Transactions on Information Theory, IT-21(2), 228-230. [14]Gelbukh, A., Han, S.Y., and Sidorov, G. (2003). Compression of boolean inverted files by document ordering. In Proceedings of 2003 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLPKE-2003), (pp. 244-249), Beijing, China, Oct. Los Alamitos, CA: IEEE Computer Society Press. [15]Golomb, S.W. (1966). Run Length Encoding. IEEE Transactions on Information Theory, IT-12(3), 399-401. [16]Hawking, D. (1996). Document retrieval performance on parallel systems. In H.R. Arabnial, ed, Proceedings of the 1996 International Conference on Parallel and Distributed Processing Techniques and Applications, Sunnyvale, (pp. 1354-1365), California, August. Athens: CSREA Press. [17]Hollaar, L.A. (1991). Special-purpose hardware for text searching: past experience, future potential. Information Processing & Management, 27 (4): 371-378. [18]Janson, B.J., Spink, A., Bateman, J., and Saracevic, T. (1998). Real life information retrieval: a study of user queries on the Web. SIGIR Forum, 32(1), 5-17. [19]Kobayashi, M. & Takeda, K. (2000). Information retrieval on the web. ACM Computing Surveys, 32(2), 144-173. [20]Lawrence, S. & Giles, C. (1999). Accessibility of information on the web. Nature, 400, 107-109. [21]Lovins, J.B. (1968). Development of a stemming algorithm. Mechanical Translation and Computational Linguistics, 11, 22-31. [22]Ma, Y.C., Chen, T.F., and Chung, C.P. (2002). Posting file partitioning and parallel information retrieval. Journal of Systems and Software, 63(2), 113-127. [23]MacFarlane, A. (2000). Distributed inverted files and performance: a study of parallelism and data distribution methods in IR (Ph.D. thesis). London: City University. [24]Mcllroy, M.D. (1982). Development of a spelling list. IEEE Transactions on Communications, COM-30(1), 91-99. [25]Moffat, A. & Stuiver, L. (2000). Binary interpolative coding for effective index compression. Information Retrieval, 3(1), 25-47. [26]Moffat, A. & Zobel, J. (1992). Parameterised compression for sparse bitmaps. In N. Belkin, P. Ingwersen, and A.M. Pejtersen (Eds.), Proceedings of 15th annual international ACM-SIGIR Conference on Research and Development in Information Retrieval, (pp. 274-285), Copenhagen, Jun. New York: ACM Press. [27]Moffat, A. & Zobel J. (1996). Self-indexing inverted files for fast text retrieval. ACM Transactions on Information Systems, 14(4), 349-379. [28]Moffat, A., Zobel, J., and Klein, S.T. (1995). Improved inverted file processing for large text databases. In R. Sacks-Davis & J. Zobel (Eds.), Proceedings of 6th Australasian Database Conference, (pp. 162-171), Adelaide, Australia, Jan. [29]Olken, F. & Rotem, D. (1986). Rearranging data to maximize the efficiency of compression. In Proceedings of the fifth ACM SIGACT-SIGMOD symposium on Principles of database systems, (pp. 78-90), Cambridge, Massachusetts, United States, Mar. New York: ACM Press. [30]Reddaway, S.F. (1991). High speed text retrieval from large databases on a massively parallel processor. Information Processing & Management, 27 (4): 311-316. [31]Ribeiro-Neto, B., Moura, E.S., Neubert, M.S., and Ziviani, N. (1999). Efficient distributed algorithms to build inverted files. In M. Hearst, F. Gey, and R. Tong (Eds.), Proceedings for the 22nd International Conference on the Research and Development in Information Retrieval (SIGIR'99), (pp. 105-112). New York: ACM Press. [32]Salton, G. (1989). Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Reading, Mass: Addison-Wesley. [33]Salton, G. & McGill, M.J. (1983). Introduction to Modern Information Retrieval. New York: McGraw-Hill. [34]Scholer, F., Williams, H.E., Yiannis, J., and Zobel, J. (2002). Compression of inverted indexes for fast query evaluation. In M. Beaulieu, R. Baeza-Yates, S.H. Myaeng, and K. Järvelin (Eds.), Proceedings of the 25th annual international ACM SIGIR conference on Research and Development in Information Retrieval, (pp. 222-229), Tampere, Finland. New York: ACM Press. [35]Shieh, W.Y., Chen, T.F., Shann, J.J., and Chung, C.P. (2003). Inverted file compression through document identifier reassignment. Information Processing and Management, 39(1), 117-131. [36]Stanfill, C., Thau, R., and Waltz, D. (1989). A parallel Indexed algorithm for Information Retrieval. In N.J. Belkin & C.J. Van Rijsbergen (Eds.), Proceedings of the 12th annual conference on research and development in Information Retrieval (SIGIR'89), (pp. 88-97). New York:ACM Press. [37]Stanfill, C. & Thau, R. (1991). Information retrieval on the connection machine: 1 to 8192 Gigabytes. Information Processing & Management, 27 (4): 285-310. [38]Tenenbaum, A.M., Langsam, Y., and Augenstein, M.J. (1990). Data structures using C. Englewood CLiffs, N.J. 07632: Prentice-Hall. [39]Teuhola, J. (1978). A Compression method for clustered bit-vectors. Information Processing Letters, 7(6), 308-311. [40]Trotman, A. (2003). Compressing inverted files. Information Retrieval, 6(1), 5-19. [41]Turpin, A. (1998). Efficient prefix coding (Ph.D. thesis). Melbourne: University of Melbourne. [42]Turtle, H. & Flood, J. (1995). Query evaluation: strategies and optimizations. Information Processing & Management, 31(6): 831-850. [43]Voorhees, E. & Harman, D. (1997). Overview of the sixth text retrieval conference (TREC-6). In E.M. Voorhees & D.K. Harman (Eds.), Proceedings of the Sixth Text REtrieval Conference (TREC-6), (pp. 1-24). Gaithersburg, MD: NIST. [44]Williams, H.E. & Zobel, J. (2002). Indexing and retrieval for genomic databases. IEEE Transactions on Knowledge and Data Engineering, 14(1), 63-78. [45]Williams, H.E. & Zobel, J. (1999). Compressing integers for fast file access. The Computer Journal, 42(3), 193-201. [46]Witten, I.H., Moffat, A., and Bell, T.C. (1999). Managing Gigabytes: Compressing and Indexing on Documents and Images, Second Edition. San Francisco, CA: Morgan Kaufmann Publishers. [47]Wolfram, D. (1992). Applying informetric characteristics of databases to ir system file design, part i: informetric models. Information Processing and Management, 28(1), 121-133. [48]Xie, Y. & O’Hallaron, D. (2002). Locality in search engine queries and its implications for caching. In P. Kermani, F. Bauer, and P. Morreale (Eds.), Proceedings of the 21th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM'02), (pp. 1238-1247), New York, Jun. [49]Zipf G. (1949). Human Behavior and the Principle of Least Effort. New York: Addison-Wesley. [50]Zobel, J. & Moffat, A. (1995). Adding compression to a full-text retrieval system. Software Practice and Experience, 25(8), 891-903. [51]Zobel, J., Moffat, A., and Ramamohanarao, K. (1998). Inverted files versus signature files for text indexing. ACM Transactions on Database Systems, 23(4), 453-490.
|