Show simple item record

dc.contributor.advisorQureshi, Faisal
dc.contributor.authorBaeenh, Mohmmed
dc.date.accessioned2020-02-27T14:30:26Z
dc.date.accessioned2022-03-29T17:25:50Z
dc.date.available2020-02-27T14:30:26Z
dc.date.available2022-03-29T17:25:50Z
dc.date.issued2020-01-01
dc.identifier.urihttps://hdl.handle.net/10155/1132
dc.description.abstractWe propose a computational attention approach to localize and classify characters in a sequence in a given image. Our approach combines spatial soft-attention with attention regularization and learns “where-to-look” to carry out the sequence classification task. The image is first passed through a Convolutional Neural Network (CNN) that serves as feature extractor. Then at each Recurrent Neural Network (RNN) time step, the attention mechanism attends to the relevant features sequentially to make predictions. The attention mechanism also includes a start and stop state, which instructs the mechanism to start looking and guides it when to stop (e.g., when the sequence has been exhausted). We demonstrate our approach on two sequence detection tasks—multi-digit classification and CAPTCHA unlocking—using the publicly available Street View House Numbers (SVHN) dataset and a custom CAPTCHA dataset. The experiments confirm our hypothesis that the network learns to attend to relevant features by minimizing the loss between the ground truth attention masks and the predicted attention masks.en
dc.description.sponsorshipUniversity of Ontario Institute of Technologyen
dc.language.isoenen
dc.subjectComputational Attentionen
dc.subjectConvolutional Neural Networksen
dc.subjectRecurrent Neural Networksen
dc.subjectMulti-Digit Classificationen
dc.subjectCAPTCHAen
dc.titleMulti-character prediction using attentionen
dc.typeThesisen
dc.degree.levelMaster of Science (MSc)en
dc.degree.disciplineComputer Scienceen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record