The Embed DocScanner can get the image from the user, which contains a paper or a document. Then it converts it to a pdf file, which you can use later on and you can extract the text from that image as well.
Libraries: Pillow, opencv, tesseract, numpy, imutils, scikit-image