Pix2Text issues

找不到ConvBNActivation的声明是怎么回事？？

1

源代码： ``` from pix2text import Pix2Text img_fp = 'C:/Users/PC/AppData/Local/Programs/Python/Python38/Envs/Pytorch/formula.jpg' p2t = Pix2Text() out_text = p2t(img_fp) # 也可以使用 `p2t.recognize(img_fp)` 获得相同的结果 print(out_text) ``` 运行结果： cannot import name 'ConvBNActivation' from 'torchvision.models.mobilenetv2' 如图： ![image](https://user-images.githubusercontent.com/98604286/198835308-3b4aedac-f3a6-43bd-a5a0-e9f3306bb437.png)...

zongpingding

关于recognize函数

2

请问作者有没有考虑对recognize函数后续提供可以选择指定识别模型的参数呢，比如如果指定模型如type="formula"那么就选择指定公式模型进行识别，但是如果传入是"none"那么还是以前的自动识别。因为如下图所示，对于同一个公式我发现如果截图的框选范围不同有时会导致在"**general**"和"**formula**"之间横跳 ![QA{ 1`2 JDFSGBC9MUWLS6](https://user-images.githubusercontent.com/86911121/194149681-27b332a0-7118-44f2-89f2-ef03c91afd1e.png) ![9GJFP2%`_6PP%4K61_WV3NI](https://user-images.githubusercontent.com/86911121/194149740-8a27ca78-3ea5-4d6b-94fb-d15f281c7640.png)

Alune233

failed to download the models through cn oss

3

I have solved it mostly, though I have to make a warning for others' convenience: when first using pix2text and automatically downloading models, whether cnocr or cnstd used the url...

vandermondeDET

YoloV7_Tiny 自动下载失败

1

YoloV7_Tiny 自动下载失败 [WARNING 2024-04-07 17:15:53,577 _assert_and_prepare_model_files:214] Can NOT find model file C:\Users\Administrator\AppData\Roaming\cnstd\1.2\analysis\mfd-yolov7_tiny.pt [ERROR 2024-04-07 17:15:53,917 get_model_file:251] Failed downloading url https://sg-models.oss-cn-beijing.aliyuncs.com/cnstd/1.2/yolov7_tiny_mfd-pytorch.zip pix2text=1.0.2.3

xiehurricane

Index out of bounds when merging text

1

``` for _out in outs: line_number = _out.get('line_number', 0) + 1 if len(out_texts)

tonyzdev

可以添加文档识别功能

1

最近在逛github时看到https://github.com/Mathpix/mpx-cli这个项目，不知道本项目可不可以借鉴一下，做出类似的功能。

zsy1207

请问下分式和大括号有时候检测不到，是参数不对吗？下面是hf-demo测试的结果

4

![0a54af7a250d4388c8dae5f0e2554ff5](https://github.com/breezedeus/Pix2Text/assets/66133175/1fb2135f-5294-4ffe-9c0a-0abd34380a6a) ![微信图片_20240305142003](https://github.com/breezedeus/Pix2Text/assets/66133175/999440fe-f5df-4d97-be8c-06ca783927e7)

openforward

Patterns of incorrect response

1

Hey everyone, I was benchmarking Pix2Text with Mathpix and found the following patterns for which Pix2Text gives incorrect output while Mathpix gives the correct response. If there are plans for...

ishan00

docker部署到python:3.10.13会返回<unused3>

2

调用recognize(img, use_analyzer=True)返回json会带 { "code": 200, "message": "OK", "data": [ { "type": "embedding", "text": " $$\\frac{3}{5}$$ ", "position": [ [ 205.0, 11.0 ], [ 223.0, 11.0 ], [ 223.0, 49.0 ],...

FocusHuang

Idea : merge digital extraction result and OCR result.

5

Hi, I'm currently developing a pdf parser specialised for math pdf. The non-OCR solutions offer great accuracy for text because they are simply extracted, not detected optically. So, is it...

Shadow-Alex

Pix2Text
Pix2Text copied to clipboard

Metadata

找不到ConvBNActivation的声明是怎么回事？？

关于recognize函数

failed to download the models through cn oss

YoloV7_Tiny 自动下载失败

Index out of bounds when merging text

可以添加文档识别功能

请问下分式和大括号有时候检测不到，是参数不对吗？下面是hf-demo测试的结果

Patterns of incorrect response

docker部署到python:3.10.13会返回<unused3>

Idea : merge digital extraction result and OCR result.

← Metadata

Owner

Metadata

Pix2Text Pix2Text copied to clipboard

Metadata

← Metadata

Owner

Metadata

Pix2Text
Pix2Text copied to clipboard