MyGit

ai25395/FMatPix

Fork: 13 Star: 269 (更新于 2024-12-20 19:30:49)

license: 暂无

Language: Python .

A free portable Formula Ocr tool supporting latex and mathml

最后发布版本: v1.7 ( 2024-11-05 21:24:09)

GitHub网址

FMatPix - A Free-Fast-NoDep-Offline Formula Ocr Tool

A free portable Formula Ocr tool supporting latex and mathml based on Texify.
You can use this on your own PC with just CPU and no need for Internet.

What's the advantage of this?

It's free,portable,offline,fast,accurate.
免费、便携、离线、快速、准确
So don't hesitate share this with your friends who suffer from paid、user-unfriendly、slow OCR tools.

How to get the .exe?

从这个网址选择版本下载,注意版本说明里提供了迅雷网盘链接:https://github.com/ai25395/FMatPix/releases
Download the software FMatPix.exe from the release page: https://github.com/ai25395/FMatPix/releases

Don't forget to give me a star it's very important for me!别忘了给我点个星,这对我真的很重要!

Version features

Page Ocr


Batch Ocr & History


How to use the software?

1、Alt+Q to snip(no matter in single/batch/page mode) and then the recognition will start.
2、Click 'Copy Latex' or 'Copy MathMl' to copy the recognition result.
3、Double click 'Copy Latex' or 'Copy MathMl' to enable result-auto-copy.
4、'History' shows the history snipped pictures of formulas.Click the picture to copy the result.
5、Double click 'Batch Ocr' to enable batch-ocr mode.In this mode, recognition won't start after
you capture an image.By clicking the 'Batch Ocr',all the captured images will be recognized one time.
6、Double click 'Page Ocr' to enable page-ocr mode.In this mode,after capturing a large image,
all formulas in it will be box selection and you can copy a formula by clicking its box.
You can see this video for some usage details: https://www.bilibili.com/video/BV137SdYCEd6

How to use the code?

1.Setup your python env according to the requirements.txt
2.Create a folder named "models", download and unzip models_texify.zip to the folder "models" from release page v1.7
3.Create a folder named "texmml", download and unzip texmml.zip to the folder "texmml" from release page v1.2
4.Just run the ocr_gui.py
I've tried my best to simplify the code and preject,so it's easy to modify the GUI and intergrate other OCR model to this.

Potential issues

1、It will consume much time if you snip a very large image.
2、Some format errors in MathML because it's tranformed from latex.I've tried to solve most of them.I suggest copying latex to mathtype directly
3、Sometimes, there can be formatting issues with curly braces.
4、Some letters or symbols cannot be recognized because not or rarely in the token vocabulary or training set.
5、If you paste a very long formula to mathtype, it will report a error.So please avoid that.

If you find any other issues,contact me.I'll keep updating this if necessary.

Fequently Asked Questions

1、Support for Mac
I'm trying to find a Mac to develop and test first but there is even no a Mac I can borrow.So I'll try a cloud Mac then.
2、Self-defined Shortcut for Snipping
I have set Alt+Q in the v1.4 for snipping which will satify most people.And probably I wont't add a self-define function, because it'll introduce a config file which will make FMatPix.exe -> FMatPix.exe + xxx.xml. I mean there'll be an additional file.
Forgive my OCD please.
3、Support for double monitors
From v1.5,FMatPix has started supporting double monitors.

A cup of coffee for the author

It is not very easy to develop the project, so if it is helpful to you, please consider offering me a cup of coffee🥤.

最近版本更新:(数据更新于 2024-12-04 05:00:31)

2024-11-05 21:24:09 v1.7

2024-10-31 22:42:32 v1.6

2024-10-30 16:29:12 v1.5-alpha

2024-10-29 10:29:13 v1.4

2024-10-25 19:44:09 v1.3

2024-10-25 14:22:02 v1.2

2024-10-23 16:39:18 v1.1

2024-10-20 16:51:29 v1.0

ai25395/FMatPix同语言 Python最近更新仓库

2024-12-26 21:25:48 ultralytics/ultralytics

2024-12-26 17:11:27 princeton-vl/infinigen

2024-12-25 10:53:05 fishaudio/fish-speech

2024-12-25 00:20:24 Zipstack/unstract

2024-12-24 23:43:34 home-assistant/core

2024-12-24 21:37:31 MaxLastBreath/TOTK-mods