UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 10 Issue 12
December-2023
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2312486


Registration ID:
530297

Page Number

e697-e701

Share This Article


Jetir RMS

Title

Data Extraction using Optical Character Recognition and Natural Language Processing

Authors

Abstract

Over the recent years, there’s been a boost in research on data extraction. In today’s digitalized world, everything has become digital like Text files, Invoices, Bills, etc. Optical Character Recognition (OCR) is the technology which converts an image into a machine-readable text format, hence stores as text data. Nowadays, most businesses involve receiving information from print media, like invoices, printed contracts, etc. Therefore, manually this process takes a lot of time and can be tedious and so this project will help them. In this project, our aim is to extract whole data from Invoices along with the texts present inside the table. And classify them like dictionary, Key: Value pair.

Key Words

Deep learning, OCR, Named Entity Recognition (NER), Natural Language Processing (NLP), YOLO, Data extraction

Cite This Article

"Data Extraction using Optical Character Recognition and Natural Language Processing", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.10, Issue 12, page no.e697-e701, December-2023, Available :http://www.jetir.org/papers/JETIR2312486.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Data Extraction using Optical Character Recognition and Natural Language Processing", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.10, Issue 12, page no. ppe697-e701, December-2023, Available at : http://www.jetir.org/papers/JETIR2312486.pdf

Publication Details

Published Paper ID: JETIR2312486
Registration ID: 530297
Published In: Volume 10 | Issue 12 | Year December-2023
DOI (Digital Object Identifier):
Page No: e697-e701
Country: New Delhi, Delhi, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000116

Print This Page

Current Call For Paper

Jetir RMS