Pix2Text icon indicating copy to clipboard operation
Pix2Text copied to clipboard

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowerin...

Results 59 Pix2Text issues
Sort by recently updated
recently updated
newest added

源代码: ``` from pix2text import Pix2Text img_fp = 'C:/Users/PC/AppData/Local/Programs/Python/Python38/Envs/Pytorch/formula.jpg' p2t = Pix2Text() out_text = p2t(img_fp) # 也可以使用 `p2t.recognize(img_fp)` 获得相同的结果 print(out_text) ``` 运行结果: cannot import name 'ConvBNActivation' from 'torchvision.models.mobilenetv2' 如图: ![image](https://user-images.githubusercontent.com/98604286/198835308-3b4aedac-f3a6-43bd-a5a0-e9f3306bb437.png)...

请问作者有没有考虑对recognize函数后续提供可以选择指定识别模型的参数呢,比如如果指定模型如type="formula"那么就选择指定公式模型进行识别,但是如果传入是"none"那么还是以前的自动识别。 因为如下图所示,对于同一个公式我发现如果截图的框选范围不同有时会导致在"**general**"和"**formula**"之间横跳 ![QA{ 1`2 JDFSGBC9MUWLS6](https://user-images.githubusercontent.com/86911121/194149681-27b332a0-7118-44f2-89f2-ef03c91afd1e.png) ![9GJFP2%`_6PP%4K61_WV3NI](https://user-images.githubusercontent.com/86911121/194149740-8a27ca78-3ea5-4d6b-94fb-d15f281c7640.png)

I have solved it mostly, though I have to make a warning for others' convenience: when first using pix2text and automatically downloading models, whether cnocr or cnstd used the url...

YoloV7_Tiny 自动下载失败 [WARNING 2024-04-07 17:15:53,577 _assert_and_prepare_model_files:214] Can NOT find model file C:\Users\Administrator\AppData\Roaming\cnstd\1.2\analysis\mfd-yolov7_tiny.pt [ERROR 2024-04-07 17:15:53,917 get_model_file:251] Failed downloading url https://sg-models.oss-cn-beijing.aliyuncs.com/cnstd/1.2/yolov7_tiny_mfd-pytorch.zip pix2text=1.0.2.3

``` for _out in outs: line_number = _out.get('line_number', 0) + 1 if len(out_texts)

最近在逛github时看到https://github.com/Mathpix/mpx-cli这个项目,不知道本项目可不可以借鉴一下,做出类似的功能。

![0a54af7a250d4388c8dae5f0e2554ff5](https://github.com/breezedeus/Pix2Text/assets/66133175/1fb2135f-5294-4ffe-9c0a-0abd34380a6a) ![微信图片_20240305142003](https://github.com/breezedeus/Pix2Text/assets/66133175/999440fe-f5df-4d97-be8c-06ca783927e7)

Hey everyone, I was benchmarking Pix2Text with Mathpix and found the following patterns for which Pix2Text gives incorrect output while Mathpix gives the correct response. If there are plans for...

调用recognize(img, use_analyzer=True)返回json会带 { "code": 200, "message": "OK", "data": [ { "type": "embedding", "text": " $$\\frac{3}{5}$$ ", "position": [ [ 205.0, 11.0 ], [ 223.0, 11.0 ], [ 223.0, 49.0 ],...

Hi, I'm currently developing a pdf parser specialised for math pdf. The non-OCR solutions offer great accuracy for text because they are simply extracted, not detected optically. So, is it...