Efficient Algorithm for Auto Correction Using n-gram Indexing

Please use this identifier to cite or link to this item: http://10.1.7.192:80/jspui/handle/123456789/2654

Title:	Efficient Algorithm for Auto Correction Using n-gram Indexing
Authors:	Lalwani, Mahesh Bagmar, Nitesh Parikh, Saurin
Keywords:	Edit Distance Ngram Trigram String Searching Pattern Matching Computer Faculty Paper Faculty Paper ITFCA001
Issue Date:	Jun-2011
Publisher:	IOAJ Pub.
Series/Report no.:	ITFCA001-2
Abstract:	Auto correction functionality is very popular in search portals. Its principal purpose is to correct common spelling or typing errors, saving time for the user. However, when there are millions of strings in a dictionary, it takes considerable amount of time to find the nearest matching string. Various approaches have been proposed for efficiently implementing auto correction functionality. All of these approaches focus on using suitable data structure and few heuristics to solve the problems. Here, we propose a new idea which eliminates the need for calculating edit distance with each string in the dictionary. It uses the concept of Ngram based indexing and hashing to filter out irrelevant strings from dictionary. Experiments suggest that proposed algorithm provides both efficient and accurate results.
Description:	International Journal of Computer & Communication Technology, Vol. 2 (7) Jun, 2011, Page No. 23-27
URI:	http://hdl.handle.net/123456789/2654
Appears in Collections:	Faculty Papers, CE

Files in This Item:

File	Description	Size	Format
ITFCA001-2.pdf	ITFCA001-2	106.76 kB	Adobe PDF	View/Open

IR @ Nirma University