Please use this identifier to cite or link to this item:
http://10.1.7.192:80/jspui/handle/123456789/2654
Title: | Efficient Algorithm for Auto Correction Using n-gram Indexing |
Authors: | Lalwani, Mahesh Bagmar, Nitesh Parikh, Saurin |
Keywords: | Edit Distance Ngram Trigram String Searching Pattern Matching Computer Faculty Paper Faculty Paper ITFCA001 |
Issue Date: | Jun-2011 |
Publisher: | IOAJ Pub. |
Series/Report no.: | ITFCA001-2 |
Abstract: | Auto correction functionality is very popular in search portals. Its principal purpose is to correct common spelling or typing errors, saving time for the user. However, when there are millions of strings in a dictionary, it takes considerable amount of time to find the nearest matching string. Various approaches have been proposed for efficiently implementing auto correction functionality. All of these approaches focus on using suitable data structure and few heuristics to solve the problems. Here, we propose a new idea which eliminates the need for calculating edit distance with each string in the dictionary. It uses the concept of Ngram based indexing and hashing to filter out irrelevant strings from dictionary. Experiments suggest that proposed algorithm provides both efficient and accurate results. |
Description: | International Journal of Computer & Communication Technology, Vol. 2 (7) Jun, 2011, Page No. 23-27 |
URI: | http://hdl.handle.net/123456789/2654 |
Appears in Collections: | Faculty Papers, CE |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ITFCA001-2.pdf | ITFCA001-2 | 106.76 kB | Adobe PDF | ![]() View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.