Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/2654
Title: Efficient Algorithm for Auto Correction Using n-gram Indexing
Authors: Lalwani, Mahesh
Bagmar, Nitesh
Parikh, Saurin
Keywords: Edit Distance
Ngram
Trigram
String Searching
Pattern Matching
Computer Faculty Paper
Faculty Paper
ITFCA001
Issue Date: Jun-2011
Publisher: IOAJ Pub.
Series/Report no.: ITFCA001-2
Abstract: Auto correction functionality is very popular in search portals. Its principal purpose is to correct common spelling or typing errors, saving time for the user. However, when there are millions of strings in a dictionary, it takes considerable amount of time to find the nearest matching string. Various approaches have been proposed for efficiently implementing auto correction functionality. All of these approaches focus on using suitable data structure and few heuristics to solve the problems. Here, we propose a new idea which eliminates the need for calculating edit distance with each string in the dictionary. It uses the concept of Ngram based indexing and hashing to filter out irrelevant strings from dictionary. Experiments suggest that proposed algorithm provides both efficient and accurate results.
Description: International Journal of Computer & Communication Technology, Vol. 2 (7) Jun, 2011, Page No. 23-27
URI: http://hdl.handle.net/123456789/2654
Appears in Collections:Faculty Papers, CE

Files in This Item:
File Description SizeFormat 
ITFCA001-2.pdfITFCA001-2106.76 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.