Get Med Data

Extract data from medical records and restore old damaged records!

Installs required:

!sudo apt install tesseract-ocr
!pip install pytesseract
pip install language_tool_python
pip install PyPDF2
pip install reportlab

How the code works:

Step 1: The original image is converted into grayscale and saved as a new image. The new image is then read, and text is extracted using Tesseract OCR. This is the initial step where the first attempt is made to read the text and return the text.

Step 2: Here, there are two possibilities. One that the original image was clean. The other that the original image is a picture of an old record that might have had some physical damage to the paper, resulting in incorrect OCR output. So, to check if there were inaccurate readings, we will check for grammatical errors in the text. A high error rate would indicate issues with the original image.

Step 3: Now that we have our clean text, we can search for the variant name and highlight it on the image.

Step 4: Make a standardized presentation format for all the images to make it easier to understand.

Step 5: Put a final output together with the original image, image with highlights and the standardized form of data in a PDF file.

Output Text and PDF File

You can see the actual output text of the code, saved in the text file. I have also uploaded a PDF that was generated by the script with the standardized form of data, using sample data.

Links

Devpost - https://devpost.com/software/get-med-data

YouTube - https://youtu.be/ZaZfgluj7pE

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
exract.py		exract.py
gramCheck.py		gramCheck.py
output.pdf		output.pdf
outputText.txt		outputText.txt
pdfManage.py		pdfManage.py
standard.py		standard.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Get Med Data

Installs required:

How the code works:

Output Text and PDF File

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Get Med Data

Installs required:

How the code works:

Output Text and PDF File

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages