Show simple item record

Word Spotting in Bitmapped Fax Documents

dc.contributor.authorWilliams, William J.en_US
dc.contributor.authorZalubas, Eugene J.en_US
dc.contributor.authorHero, Alfred O. IIIen_US
dc.date.accessioned2006-09-11T17:14:40Z
dc.date.available2006-09-11T17:14:40Z
dc.date.issued2000-05en_US
dc.identifier.citationWilliams, William J.; Zalubas, Eugene J.; Hero, Alfred O.; (2000). "Word Spotting in Bitmapped Fax Documents." Information Retrieval 2 (2-3): 207-226. <http://hdl.handle.net/2027.42/45988>en_US
dc.identifier.issn1386-4564en_US
dc.identifier.issn1573-7659en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/45988
dc.description.abstractImages and signals may be represented by forms invariant to time shifts, spatial shifts, frequency shifts, and scale changes. Advances in time-frequency analysis and scale transform techniques have made this possible. However, factors such as noise contamination and “style” differences complicate this. An example is found in text, where letters and words may vary in size and position. Examples of complicating variations include the font used, corruption during facsimile (fax) transmission, and printer characteristics. The solution advanced in this paper is to cast the desired invariants into separate subspaces for each extraneous factor or group of factors. The first goal is to have minimal overlap between these subspaces and the second goal is to be able to identify each subspace accurately. Concepts borrowed from high-resolution spectral analysis, but adapted uniquely to this problem have been found to be useful in this context. Once the pertinent subspace is identified, the recognition of a particular invariant form within this subspace is relatively simple using well-known singular value decomposition (SVD) techniques. The basic elements of the approach can be applied to a variety of pattern recognition problems. The specific application covered in this paper is word spotting in bitmapped fax documents.en_US
dc.format.extent273597 bytes
dc.format.extent3115 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypetext/plain
dc.language.isoen_US
dc.publisherKluwer Academic Publishers; Springer Science+Business Mediaen_US
dc.subject.otherComputer Scienceen_US
dc.subject.otherData Structures, Cryptology and Information Theoryen_US
dc.subject.otherManagement of Computing and Information Systemsen_US
dc.subject.otherWord Spottingen_US
dc.subject.otherFacsimileen_US
dc.subject.otherScaleen_US
dc.subject.otherPositionen_US
dc.subject.otherInvarianten_US
dc.titleWord Spotting in Bitmapped Fax Documentsen_US
dc.typeArticleen_US
dc.subject.hlbsecondlevelPhilosophyen_US
dc.subject.hlbsecondlevelComputer Scienceen_US
dc.subject.hlbtoplevelHumanitiesen_US
dc.subject.hlbtoplevelEngineeringen_US
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumElectrical Engineering and Computer Science Dept., University of Michigan, Ann Arbor, MI, 48109, USAen_US
dc.contributor.affiliationumElectrical Engineering and Computer Science Dept., University of Michigan, Ann Arbor, MI, 48109, USAen_US
dc.contributor.affiliationumElectrical Engineering and Computer Science Dept., University of Michigan, Ann Arbor, MI, 48109, USAen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/45988/1/10791_2004_Article_260693.pdfen_US
dc.identifier.doihttp://dx.doi.org/10.1023/A:1009958827317en_US
dc.identifier.sourceInformation Retrievalen_US
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.