<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Activity for Tess4J</title><link>https://sourceforge.net/p/tess4j/activity/</link><description>Recent activity for Tess4J</description><language>en</language><lastBuildDate>Thu, 06 Feb 2025 04:56:55 -0000</lastBuildDate><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#43cb</link><description>Yes, it can be. https://github.com/java-native-access/jna/blob/master/www/DirectMapping.md</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 06 Feb 2025 04:56:55 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#43cb</guid></item><item><title>George posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#3024</link><description>thanks, I will try it out. Is direct mapping more efficient than using the interface ?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Wed, 05 Feb 2025 18:05:58 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#3024</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/62eabd2eae/?limit=25#b717</link><description>Leptonica library has many dependencies to open various image file types, such as TIFF, PNG, JPEG, etc., which in turn have other dependencies, as you've seen. On Windows, we were able to embed all the image library dependencies inside libleptonica.dll. We don't know how to generate a similar static library liblept.so on Linux. Installing Tesseract would ensure installing of all the required dependency libraries.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Wed, 05 Feb 2025 05:40:26 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/62eabd2eae/?limit=25#b717</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#561c</link><description>Please use the JNA Direct Mapping API — Leptonica1. https://tess4j.sourceforge.net/docs/index.html</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Wed, 05 Feb 2025 05:26:43 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#561c</guid></item><item><title>George modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</link><description>I encountered ** jdk.internal.org.objectweb.asm.MethodTooLargeException** when I tried to load lept4j 1.16.1 using OpenJDK Runtime Environment Corretto-21.0.6.7.1 (build 21.0.6+7-LTS) I assume this is caused by ASM library used by JDK during invocation of Native.loadLibrary() when the byte code size exceeds the JDK's method size limit (64KB), . A similar issue reported here https://bugs.openjdk.org/browse/JDK-8314528 Is there any work around to load lept4j 1.16.1 on JDK 21 without requiring to create...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Fri, 31 Jan 2025 18:22:54 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</guid></item><item><title>George modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</link><description>I encountered ** jdk.internal.org.objectweb.asm.MethodTooLargeException** when I tried to load lept4j 1.16.1 using OpenJDK Runtime Environment Corretto-21.0.6.7.1 (build 21.0.6+7-LTS) I assume this is caused by ASM library used by JDK during invocation of Native.loadLibrary() when the byte code size exceeds the JDK's method size limit (64KB), . A similar issue reported here https://bugs.openjdk.org/browse/JDK-8314528 Is there any work around to load lept4j 1.16.1 on JDK 21 without requiring to create...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Fri, 31 Jan 2025 17:46:37 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</guid></item><item><title>George modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</link><description>I encountered ** jdk.internal.org.objectweb.asm.MethodTooLargeException** when I tried to load lept4j 1.16.1 using OpenJDK Runtime Environment Corretto-21.0.6.7.1 (build 21.0.6+7-LTS) I assume this is caused by ASM library used by JDK during invocation of Native.loadLibrary() when the byte code size exceeds the JDK's method size limit (64KB), . A similar issue reported here https://bugs.openjdk.org/browse/JDK-8314528 Is there any work around to load lept4j 1.16.1 on JDK 21 without requiring to create...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Fri, 31 Jan 2025 16:42:32 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</guid></item><item><title>George modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</link><description>I encountered ** jdk.internal.org.objectweb.asm.MethodTooLargeException** when I tried to load lept4j 1.16.1 using OpenJDK Runtime Environment Corretto-21.0.6.7.1 (build 21.0.6+7-LTS) I assume this is caused by ASM library used by JDK during invocation of Native.loadLibrary() , and appears to be similar to the issue reported here https://bugs.openjdk.org/browse/JDK-8314528 Is there any work around to load lept4j 1.16.1 on JDK 21 ? On testing tess4j 5.0.0 seems to load without issue on JDK21</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Fri, 31 Jan 2025 16:03:55 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</guid></item><item><title>George posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</link><description>I encountered ** jdk.internal.org.objectweb.asm.MethodTooLargeException** when I tried to load tess4j 5.0.0 and lept4j 1.16.1 using OpenJDK Runtime Environment Corretto-21.0.6.7.1 (build 21.0.6+7-LTS) I assume this is caused by ASM library used by JDK during invocation of Native.loadLibrary() , and appears to be similar to the issue reported here https://bugs.openjdk.org/browse/JDK-8314528 Is there any work around to load tess4j 5.0.0 and lept4j 1.16.1 on JDK 21 ?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Fri, 31 Jan 2025 15:40:54 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/d0576c42ca/?limit=25#f0e2</guid></item><item><title>Srinivas Arava posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/62eabd2eae/?limit=25#cc3b</link><description>I was able to run the tess4j in a windows machine without actually installing the software. It is picking up the required dll's from the jars or path. I am not able to do the same on linux. I tried copying the .so files one by one until I hit blocker. java,lang.UnsatisfiedLinkError: /lib64/libm.so.6: version 'GLIBC_2.29' not found (required by libpng15.so.15) My goal is to be able to run the tesst4j with expliciltly installing tesseract but by simply packaging the so files. Can someone please guide...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Srinivas Arava</dc:creator><pubDate>Tue, 05 Nov 2024 16:16:01 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/62eabd2eae/?limit=25#cc3b</guid></item><item><title>Jian Wang modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/edd3c5b551/?limit=25#4c3f</link><description>I have solved my question.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jian Wang</dc:creator><pubDate>Sat, 20 Jul 2024 15:36:25 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/edd3c5b551/?limit=25#4c3f</guid></item><item><title>Jian Wang modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/edd3c5b551/?limit=25#4c3f</link><description/><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jian Wang</dc:creator><pubDate>Sat, 20 Jul 2024 15:35:00 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/edd3c5b551/?limit=25#4c3f</guid></item><item><title>Jian Wang posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/edd3c5b551/?limit=25#4c3f</link><description>Hello, Is Tess4J an open-source project? Where is the source code please? Thank you.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jian Wang</dc:creator><pubDate>Sat, 20 Jul 2024 15:34:23 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/edd3c5b551/?limit=25#4c3f</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/fac985f12d/?limit=25#8448</link><description>For Tesseract non-Windows binary, you'll have to install or compile it yourself. https://tesseract-ocr.github.io/tessdoc/#compiling-and-installation</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Mon, 26 Jun 2023 22:48:21 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/fac985f12d/?limit=25#8448</guid></item><item><title>Angelo Schneider posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/fac985f12d/?limit=25#c171</link><description>Hello, for Macs the binay of the lib is missing: darwin/libtesseract.dylib Best regards Angelo</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Angelo Schneider</dc:creator><pubDate>Mon, 26 Jun 2023 17:55:27 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/fac985f12d/?limit=25#c171</guid></item><item><title>Anonymous posted a comment on ticket #19</title><link>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#6556</link><description>Thanks for your support. I ended up not using the path returned by the method. I let tess4j do its thing and that works fine. If I ever end up needing the path, I'll ensure that my registry value works or that I do it another way</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Wed, 17 May 2023 03:18:15 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#6556</guid></item><item><title>Quan Nguyen modified a comment on ticket #19</title><link>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#55f4</link><description>On my Win11 machine, java.io.tmpdir is resolved to C:\Users\&lt;username&gt;\AppData\Local\Temp\tess4j. You might have correctly assessed, this seems to be due to a legacy dos setting in windows on your machine. You might try setting the Windows Registry value as suggested in the second article you mentioned.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 18 Apr 2023 01:22:33 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#55f4</guid></item><item><title>Quan Nguyen posted a comment on ticket #19</title><link>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#55f4</link><description>On my Win11 machine, java.io.tmpdir is resolved to C:\Users\&lt;username&gt;\AppData\Local\Temp\tess4j. You might have correctly assessed, this seems to be due to a legacy dos setting in windows on your machine.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 18 Apr 2023 01:17:36 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#55f4</guid></item><item><title>Anonymous posted a comment on ticket #19</title><link>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#bee1</link><description>I'm using Win11 Pro (64 Bit).</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Tue, 18 Apr 2023 01:07:07 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#bee1</guid></item><item><title>Quan Nguyen modified ticket #18</title><link>https://sourceforge.net/p/tess4j/bugs/18/</link><description>Tesseract upgrade missing text when extracting</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 06 Apr 2023 23:41:24 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/18/</guid></item><item><title>Quan Nguyen posted a comment on ticket #19</title><link>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#6bfd</link><description>I remember the 8.3 filename limitation in old DOS or Windows 95 era, but all modern OSes should be able to handle the long filenames. Which Windows version are you seeing the issue in?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 06 Apr 2023 23:38:17 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/19/?limit=25#6bfd</guid></item><item><title>Anonymous created ticket #19</title><link>https://sourceforge.net/p/tess4j/bugs/19/</link><description>LoadLibs.extractTessResources() returns wrong dos  style filenames</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Tue, 21 Mar 2023 08:54:05 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/19/</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/de821a4a/?limit=25#80df/9bb1/6636</link><description>@Praveen Anand Please use the Lept4J version compatible with your Leptonica installation.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Sun, 05 Mar 2023 20:55:15 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/de821a4a/?limit=25#80df/9bb1/6636</guid></item><item><title>Praveen Anand posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/de821a4a/?limit=25#80df/9bb1</link><description>@ShawnChen Did this issue got resolved ? Im facing the exact same error</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Praveen Anand</dc:creator><pubDate>Wed, 01 Mar 2023 06:47:13 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/de821a4a/?limit=25#80df/9bb1</guid></item><item><title>Synergi posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a1c8</link><description>Fixed it. It was an issue with the JNA dependency. Had JNA loaded in another linked project. As a result it was suing the older version vs this one below. &lt;dependency&gt; &lt;groupId&gt;net.java.dev.jna&lt;/groupId&gt; &lt;artifactId&gt;jna&lt;/artifactId&gt; &lt;version&gt;5.12.1&lt;/version&gt; &lt;/dependency&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Thu, 19 Jan 2023 00:39:10 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a1c8</guid></item><item><title>Synergi modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a018</link><description>So as it appears to me.... LoadLibs wants to copy the contents from a folder named linux-x86-64 in the jar file into /tmp/tess4j/linux-x86-64. The issue I see is the folder linux-x86-64 doesn't appear to exist in the jar file (tess4j-5.5.0.jar). Now as its a Linux system, I am guessing it doesn't need this tmp folder... but regardless of this the code seems to crash. FYI it seems to execute a similar process with Lept4J and copies over a dll from a windows directory in the jar file. I don't think...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Wed, 18 Jan 2023 15:44:41 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a018</guid></item><item><title>Synergi modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a018</link><description>So as it appears to me.... LoadLibs wants to copy the contents from a folder named linux-x86-64 in the jar file into /tmp/tess4j/linux-x86-64. The issue I see is the folder linux-x86-64 doesn't appear to exist in the jar file (tess4j-5.5.0.jar). Now as its a Linux system, I am guessing it doesn't need this tmp folder... but the code seems to crash. FYI it seems to execute a similar process with Lept4J and copies over a dll form a windows directory in the jar file. I dont think its used, but it allows...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Wed, 18 Jan 2023 15:43:20 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a018</guid></item><item><title>Synergi posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a018</link><description>So as it appears to me.... LoadLibs wants to copy the contents from a folder named linux-x86-64 in the jar file into /tmp/tess4j/linux-x86-64. The issue I see is the folder linux-x86-64 doesn't appear to exist in the jar file (tess4j-5.5.0.jar). Now as its a Linux system, I am guessing it doesn't need this tmp folder... but the code seems to crash. FYI it seems to execute a similar process with Lept4J and copies over a dll form a windows directory in the jar file. I dont think its used, but it allows...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Wed, 18 Jan 2023 15:42:59 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#a018</guid></item><item><title>Synergi modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#ecf9</link><description>I am using tess4j v 5.5.0 (which is supposed to work with Tesseract 5.0.3) via Maven in Java on Linux Ubuntu 20.04.3 LTS (Focal Fossa). The application I am using worked previously using Tess4J with Tesseract 4.1.1. I keep getting errors now when I run the following code :- TessAPI.TessBaseAPI handle = TessAPI.INSTANCE.TessBaseAPICreate(); This always worked in the past but now I get the following error :- Exception in thread "pool-23-thread-1" java.lang.NoClassDefFoundError: Could not initialize...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Wed, 18 Jan 2023 15:34:09 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#ecf9</guid></item><item><title>Synergi modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#ecf9</link><description>I am using tess4j v 5.5.0 (which is supposed to work with Tesseract 5.0.3) via Maven in Java on Linux Ubuntu 20.04.3 LTS (Focal Fossa). The application I am using worked previously using Tess4J with Tesseract 4.1.1. I keep getting errors now when I run the following code :- TessAPI.TessBaseAPI handle = TessAPI.INSTANCE.TessBaseAPICreate(); This always worked in the past but now I get the following error :- Exception in thread "pool-23-thread-1" java.lang.NoClassDefFoundError: Could not initialize...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Wed, 18 Jan 2023 15:28:01 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#ecf9</guid></item><item><title>Synergi posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#ecf9</link><description>I am using tess4j v 5.5.0 (which is supposed to work with Tesseract 5.0.3) via Maven in Java on Linux Ubuntu 20.04.3 LTS (Focal Fossa). The application I am using worked previously using Tess4J with Tesseract 4.1.1. I keep getting errors now when I run the following code :- TessAPI.TessBaseAPI handle = TessAPI.INSTANCE.TessBaseAPICreate(); This always worked in the past but now I get the following error :- Exception in thread "pool-23-thread-1" java.lang.NoClassDefFoundError: Could not initialize...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Synergi</dc:creator><pubDate>Wed, 18 Jan 2023 15:25:47 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/c72af4d4b8/?limit=25#ecf9</guid></item><item><title>giuseppe coniglio posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/6ac012cdfa/?limit=25#fc22</link><description>Hi to all, I have implemented a Spring boot microservice which use tess4j 4.3.1 and pdfbox 2.0.22 in my server Oracle Linux Server , example code https://colwil.com/how-to-extract-text-from-a-scanned-pdf-using-ocr-in-java/ When I execute code with my IDE on windows pc and invoke local service, time execution is fast : "Tesseract.doOcr" 8 seconds, so when I execute api to invoke microservice's code method "Tesseract.doOcr" is slow 40-50 seconds, parameter pdf file is the same Any idea? Thanks :-)</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">giuseppe coniglio</dc:creator><pubDate>Mon, 28 Nov 2022 15:44:13 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/6ac012cdfa/?limit=25#fc22</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1e9e</link><description>If it was properly installed after built, a libtesseract.dylib symbolic link would be created. If not, you can manually create it. This link is what JNA is looking to load the native library.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Mon, 28 Nov 2022 03:09:34 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1e9e</guid></item><item><title>Tevž Selčan modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac/1050</link><description>Does this apply to Mac M1? I compiled tesseract like here (https://tesseract-ocr.github.io/tessdoc/Compiling.html#macos) and downloaded Tess4J, but I cant find the libtesseract.dylib file in any of them?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tevž Selčan</dc:creator><pubDate>Tue, 22 Nov 2022 16:10:54 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac/1050</guid></item><item><title>Tevž Selčan posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac/1050</link><description>Does this apply to Mac M1? I once compiled tesseract like here (https://tesseract-ocr.github.io/tessdoc/Compiling.html#macos) and downloaded Tess4J, but I cant find the libtesseract.dylib file in any of them?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tevž Selčan</dc:creator><pubDate>Tue, 22 Nov 2022 16:10:36 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac/1050</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/bc40350e42/?limit=25#72ef</link><description>You mean separate physical copies of the training data files? I've seen instances of Tesseract running in multithreaded applications using the same set of training data files.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Sun, 20 Nov 2022 01:05:28 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/bc40350e42/?limit=25#72ef</guid></item><item><title>George posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/bc40350e42/?limit=25#bffb</link><description>Is it necessary to have separate copies of tesseract training data when running multiple instances of Tess4j in a separate JVMs.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">George</dc:creator><pubDate>Thu, 17 Nov 2022 18:07:33 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/bc40350e42/?limit=25#bffb</guid></item><item><title>Quan Nguyen modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac</link><description>No need to modify the .jar file. Just need to set jna.library.path property to the location of libtesseract.dylib file during launch. https://tess4j.sourceforge.net/tutorial/</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 18 Oct 2022 22:06:06 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac</guid></item><item><title>Quan Nguyen modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac</link><description>No need to modify the .jar file. Just need to set jna.library.path property to the location of libtesseract.dylib file during launch.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 18 Oct 2022 22:02:50 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac</link><description>No need to modify the .jar file. Just need to set jna.library.path property to the location of libtesseract.dylib file during launch</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 18 Oct 2022 22:01:09 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#1fac</guid></item><item><title>Tevž Selčan posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#873e</link><description>Issue solved, https://stackoverflow.com/questions/21394537/tess4j-unsatisfied-link-error-on-mac-os-x</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tevž Selčan</dc:creator><pubDate>Mon, 17 Oct 2022 13:13:34 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#873e</guid></item><item><title>Tevž Selčan posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#6114</link><description>Hello, I have a problem while trying to use Tess4J with Maven. I get this error : Exception in thread "main" java.lang.UnsatisfiedLinkError: Can't load library: /Users/tevzselcan/Library/Caches/JNA/temp/jna1926430164363992306.tmp at java.base/java.lang.ClassLoader.loadLibrary(ClassLoader.java:2393) at java.base/java.lang.Runtime.load0(Runtime.java:755) at java.base/java.lang.System.load(System.java:1953) at com.sun.jna.Native.loadNativeDispatchLibraryFromClasspath(Native.java:1018) at com.sun.jna.Native.loadNativeDispatchLibrary(Native.java:988)...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tevž Selčan</dc:creator><pubDate>Mon, 17 Oct 2022 13:02:53 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/4ec2dbbe87/?limit=25#6114</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#7ff8</link><description>The bug was fixed in tess4j-5.4.0 release.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 29 Sep 2022 03:20:33 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#7ff8</guid></item><item><title>Quan Nguyen modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/e1c622c8c7/?limit=25#3fa5</link><description>The latest source is being hosted at https://github.com/nguyenq/tess4j .</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 29 Sep 2022 03:17:25 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/e1c622c8c7/?limit=25#3fa5</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/e1c622c8c7/?limit=25#3fa5</link><description>The latest source is being hosted at https://github.com/nguyenq/tess4j.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 29 Sep 2022 03:17:05 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/e1c622c8c7/?limit=25#3fa5</guid></item><item><title>Ralph Cook posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/e1c622c8c7/?limit=25#6d9f</link><description>I want to put the source for Tess4J into eclipse so I can debug a problem I'm having. The current version of the library appears to be 5.4.0; if I put a dependency for net.sourceforge.tess4j:tess4j:5.4.0 in a Maven pom.xml file and update the project, I get a tess4j-5.4.0.jar. I cannot find source labelled for that version -- the latest I can find after rooting around on tess4j.sourceforge.net is labelled 3.4.8; the sources themselves do not have version numbers in them, so I cannot tell whether...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ralph Cook</dc:creator><pubDate>Sat, 24 Sep 2022 12:21:26 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/e1c622c8c7/?limit=25#6d9f</guid></item><item><title>L Evans posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#d019</link><description>The output OCR documents look good. So, the 1 word count is really misleading. We have conditional logic that follows the createDocumentsWithResults() call that relies on the size of the Words list in the OCRResult.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">L Evans</dc:creator><pubDate>Tue, 19 Jul 2022 16:13:42 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#d019</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#3c8e</link><description>What about the output documents (files) themselves? Can you put in a new issue at https://github.com/nguyenq/tess4j/issues ? Thanks.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Mon, 18 Jul 2022 23:06:09 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#3c8e</guid></item><item><title>L Evans posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#5b81</link><description>We've encountered a bug when calling createDocumentsWithResults() from Tesseract/tess4j 4.5.5. The Tiff scanned by the method call, has 32 pages, and ~3100 words. Yet, the result produced by the Java call only contains the result of the last page scanned. The OCRResult, in Java, is an empty string in this bounding box: [ [Confidence: 95.000000 Bounding box: 313 434 938 822]], which is the same result when scanning the last page of the Tiff file. Can the Tess4j team investigate this bug ?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">L Evans</dc:creator><pubDate>Mon, 18 Jul 2022 19:17:51 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/1eb7fad937/?limit=25#5b81</guid></item><item><title>Xunnozza Vlinx Xenx posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/cc813ff403/?limit=25#5cc2</link><description>Hello, Can please reduce the unnecessary dependence jai-imageio-core:1.4.0 ? The last update was also over 4 years ago. Also, I see (sorry if I missed something) that this library is only used for TIFF Meta and this is also possible with the Java 11 api. Therefore I recommend to remove this dependency and use the new Java API.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Xunnozza Vlinx Xenx</dc:creator><pubDate>Mon, 20 Jun 2022 12:33:43 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/cc813ff403/?limit=25#5cc2</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/65353c3043/?limit=25#64b1</link><description>https://github.com/nguyenq/tess4j/issues/230</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Sun, 08 May 2022 00:56:32 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/65353c3043/?limit=25#64b1</guid></item><item><title>Peter Kronenberg posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/65353c3043/?limit=25#6c97</link><description>I'm seeing an error in the ImageDeskew routine. The below sample code shows a rotation of -6.8 (the unredacted version shows -10) on the attached file even though it should be 0. Any idea why it’s not calculating correctly? It seems to happen on somewhat sparse images like this, which understably makes it harder to figure out the orientation. I'm wondering if anything can be done to make it more accurate public class GetAngle { private static double getAngle(Path sourceFile) throws IOException {...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Peter Kronenberg</dc:creator><pubDate>Sun, 03 Apr 2022 20:12:55 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/65353c3043/?limit=25#6c97</guid></item><item><title>Moritz Weibold posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3f303061d5/?limit=25#27a6</link><description>Hey there, I am using Tess4J to extract the sum of a bill. My Maven Quarkus Server works perfektly fine on localhost in IntelliJ. After running the following command, I always pushed the target/quarkus-app/ folder onto my oracle vm. mvn clean build And as soon as the folder is uploaded, I run: java -jar server/quarkus-run.jar &amp; The issue is, that on my oracle vm the server suddenly stop at the tesseract.doOCR(tempFile) function. There is no error or any hint on why it is not working. The server also...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Moritz Weibold</dc:creator><pubDate>Tue, 29 Mar 2022 09:57:47 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3f303061d5/?limit=25#27a6</guid></item><item><title>Quan Nguyen posted a comment on ticket #18</title><link>https://sourceforge.net/p/tess4j/bugs/18/?limit=25#8576</link><description>You may want to put in a ticket at https://github.com/tesseract-ocr/tesseract/issues site. Thanks.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Wed, 23 Mar 2022 11:30:53 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/18/?limit=25#8576</guid></item><item><title>Anonymous posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#871b/d460/d182</link><description>Thanks a lot</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Wed, 23 Mar 2022 11:22:53 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#871b/d460/d182</guid></item><item><title>Anonymous created ticket #18</title><link>https://sourceforge.net/p/tess4j/bugs/18/</link><description>Tesseract upgrade missing text when extracting</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Thu, 17 Mar 2022 11:46:58 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/18/</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/309b0bc8b3/?limit=25#103c</link><description>JNA is looking for a libtesseract.dylib to load. Do you have it in system path? Several developers were able to use the library on MacOs. Please search through the forum posts.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Mon, 14 Mar 2022 02:25:58 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/309b0bc8b3/?limit=25#103c</guid></item><item><title>Ben posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/309b0bc8b3/?limit=25#b5a4</link><description>Hi, I have tried to get it to work so many times but it still is not working. I added the dependency to my maven and then wrote the code following instructions. I'm not sure why it is not working. Could someone help? Thanks!</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ben</dc:creator><pubDate>Tue, 08 Mar 2022 06:39:20 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/309b0bc8b3/?limit=25#b5a4</guid></item><item><title>Quan Nguyen posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#871b/d460</link><description>Yes, tess4j-4.6.1.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 22 Feb 2022 23:58:05 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#871b/d460</guid></item><item><title>Anonymous posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#871b</link><description>Do we have this fixed for Tess4J that will work with Tesseract 4.1.1?</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Tue, 22 Feb 2022 23:53:55 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#871b</guid></item><item><title>Anantha posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#8ae6</link><description>Thank you for the fix!</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anantha</dc:creator><pubDate>Thu, 27 Jan 2022 02:55:24 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#8ae6</guid></item><item><title>Quan Nguyen modified ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/</link><description>Security - log4j2 vulnerability - Tess4J  using old version(1.2.17) of log4j which needs upgrade to 2.17.1</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 27 Jan 2022 01:53:55 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/</guid></item><item><title>Quan Nguyen modified a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#5a9f</link><description>5.1.1 has been released with ghost4j dependency removed. Thank you for bringing this issue to our attention.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 27 Jan 2022 01:52:34 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#5a9f</guid></item><item><title>Quan Nguyen posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#5a9f</link><description>5.1.1 has been released with ghost4j dependency removed.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 27 Jan 2022 01:52:01 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#5a9f</guid></item><item><title>Quan Nguyen posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#f342</link><description>If vulnerabilities exist in ghost4j library, that's beyond our control. We can elect to remove ghost4j dependency from tess4j.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Wed, 26 Jan 2022 12:31:45 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#f342</guid></item><item><title>Anantha posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#1a1f</link><description>Upon upgrading tess4j to latest version(5.1.0) , we could still see log4j 1.2.17 dependency coming from ghost4j, could you please check Attached is the screenshot for reference</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anantha</dc:creator><pubDate>Tue, 25 Jan 2022 15:46:47 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#1a1f</guid></item><item><title>Quan Nguyen posted a comment on ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#e2e2</link><description>According to Apache Log4j Security Vulnerabilities, Log4j 1.x is not impacted by this vulnerability. Latest versions of tess4j do not have log4j dependency.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 04 Jan 2022 23:19:51 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/?limit=25#e2e2</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#cb9c</link><description>I suggest that you clone the github repository, switch to tess4j-3 branch, study and execute the unit tests in your IDE, and go from there. You may want to start out with the simple example first to ensure that the library and its dependencies are set up correctly before going further with more complicated codes.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Tue, 04 Jan 2022 23:15:27 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#cb9c</guid></item><item><title>Anantha created ticket #17</title><link>https://sourceforge.net/p/tess4j/bugs/17/</link><description>Security - log4j2 vulnerability - Tess4J  using old version(1.2.17) of log4j which needs upgrade to 2.17.1</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anantha</dc:creator><pubDate>Tue, 04 Jan 2022 17:45:44 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/17/</guid></item><item><title>Kehinde Adeoya posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#a32e</link><description>Thanks. I have switched to tesseract 3.0.5 but I'm still getting the same error. Could you help figure this out by scheduling a Zoom call? Please let me know when it's convenient by you. I am using tesseract 3.0.5.2 and Tess4j- 3.5.0, lept4j-1.13.0, and jna-5.10.0 This is the error I got this morning # # A fatal error has been detected by the Java Runtime Environment: # # SIGSEGV (0xb) at pc=0x000000012f453b6f, pid=33004, tid=9987 # # JRE version: OpenJDK Runtime Environment Homebrew (11.0.12) (build...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Mon, 03 Jan 2022 04:33:41 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#a32e</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#4c12</link><description>Your dependency versions look correct. https://mvnrepository.com/artifact/net.sourceforge.tess4j/tess4j/5.0.0 If the simple example works right, that means jna/tess4j/lept4j are working properly with your tesseract/leptonica installation. That suggests something is not working correctly in your application code. Look at the test cases in tess4j project for examples: https://github.com/nguyenq/tess4j As mentioned in Issue 1074, the font info was only available in tesseract 3.x.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Fri, 31 Dec 2021 16:56:26 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#4c12</guid></item><item><title>Kehinde Adeoya modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#11ad</link><description>Thanks for your support. The simple app works fine. The font info is what I want to obtain at the moment. The reason for this hassles. I am presently using this combination of libraries. i reasoned with you after reading the github link on this issue, but I think it's been 5 years when that was published, any current update by tesseract on it? tesseract 5.0.0-29-g727796 leptonica-1.82.0 libgif 5.2.1 : libjpeg 9d : libpng 1.6.37 : libtiff 4.3.0 : zlib 1.2.11 : libwebp 1.2.1 : libopenjp2 2.4.0 Found...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Fri, 31 Dec 2021 10:47:25 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#11ad</guid></item><item><title>Kehinde Adeoya modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#11ad</link><description>Thanks for your support. The simple app works fine. The font info is what I want to obtain at the moment. The reason for this hassles. I am presently using this combination of libraries. i reasoned with you after reading the github link on this issue, but I think it's been 5 years when that was published, any current update by tesseract on it? tesseract 5.0.0-29-g727796 leptonica-1.82.0 libgif 5.2.1 : libjpeg 9d : libpng 1.6.37 : libtiff 4.3.0 : zlib 1.2.11 : libwebp 1.2.1 : libopenjp2 2.4.0 Found...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Fri, 31 Dec 2021 10:02:33 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#11ad</guid></item><item><title>Kehinde Adeoya posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#11ad</link><description>Thanks for your support. The simple app works fine. The font info is what I want to obtain at the moment. The reason for this hassles. I am presently using this combination of libraries. i reasoned with you after reading the github link on this issue, but I think it's been 5 years when that was published, any current update by tesseract on it? tesseract 5.0.0-29-g727796 leptonica-1.82.0 libgif 5.2.1 : libjpeg 9d : libpng 1.6.37 : libtiff 4.3.0 : zlib 1.2.11 : libwebp 1.2.1 : libopenjp2 2.4.0 Found...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Fri, 31 Dec 2021 10:01:56 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#11ad</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#a9b6</link><description>What's the output of executing tesseract -v in the terminal? Make sure you use the Java library versions that match your native ones. I suggest you try a simple example first. http://tess4j.sourceforge.net/codesample.html If you want to obtain font info, I don't think the feature is not available in tesseract 4 and 5. https://github.com/tesseract-ocr/tesseract/issues/1074</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 30 Dec 2021 15:06:39 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#a9b6</guid></item><item><title>Quan Nguyen posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/03c94e2b4f/?limit=25#3afd</link><description>No. The program will convert the input PDF to a multi-page TIFF image. What you can do is process the PDF before the OCR step, probably use PDFBox to extract a specified page, then convert that page to an image, and send it to tesseract engine.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Thu, 30 Dec 2021 15:00:31 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/03c94e2b4f/?limit=25#3afd</guid></item><item><title>Kehinde Adeoya posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#4971</link><description>This is the error I'm getting # # A fatal error has been detected by the Java Runtime Environment: # # SIGSEGV (0xb) at pc=0x000000010c005e9d, pid=66743, tid=9475 # # JRE version: Java(TM) SE Runtime Environment (17.0.1+12) (build 17.0.1+12-LTS-39) # Java VM: Java HotSpot(TM) 64-Bit Server VM (17.0.1+12-LTS-39, mixed mode, sharing, tiered, compressed oops, compressed class ptrs, g1 gc, bsd-amd64) # Problematic frame: # C [libtesseract.dylib+0x5e9d] tesseract::TessBaseAPI::Init(char const*, int, char...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Thu, 30 Dec 2021 11:40:27 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#4971</guid></item><item><title>Kehinde Adeoya posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#8407</link><description>@nguyenq can i have an answer to this, please, I'm fagged out trying to resolve a single problem for over a week.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Thu, 30 Dec 2021 11:36:16 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#8407</guid></item><item><title>Kehinde Adeoya modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#0eac</link><description>I have earlier posted in the wrong forum. I have tried to repost in the HELP forum but it seems there's no way to edit and switch forums once it has been submitted. This is a cry for help. I am fagged out trying to resolve this problem for over a week. It's simple installing and setup of Tesseract and Tess4J on MacOS Monterey. I have followed all docs available but none could resolve the issue. I hope I can find the right help here. I am trying to get the text/font/style properties of an image. i...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Wed, 29 Dec 2021 14:41:54 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#0eac</guid></item><item><title>Kehinde Adeoya posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#0eac</link><description>I have earlier posted in the wrong forum. I have tried to repost in the HELP forum but it seems there's no way to edit and switch forums once it has been submitted. This is a cry for help. I am fagged out trying to resolve this problem for over a week. It's simple installing and setup of Tesseract and Tess4J on MacOS Monterey. I have followed all docs available but none could resolve the issue. I hope I can find the right help here. I am trying to get the text/font/style properties of an image. i...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Kehinde Adeoya</dc:creator><pubDate>Wed, 29 Dec 2021 14:40:50 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e41474502/?limit=25#0eac</guid></item><item><title>Alfonso Vizcaino modified a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/03c94e2b4f/?limit=25#fd19</link><description>Hello When using PDF files with multiple pages, is there a way to specify which page i want to do OCR? Thanks</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Alfonso Vizcaino</dc:creator><pubDate>Mon, 27 Dec 2021 20:32:33 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/03c94e2b4f/?limit=25#fd19</guid></item><item><title>Alfonso Vizcaino posted a comment on discussion Open Discussion</title><link>https://sourceforge.net/p/tess4j/discussion/1202293/thread/03c94e2b4f/?limit=25#fd19</link><description>Hello When using PDF files with multiple pages, is there a way to specify which page i want to do OCR? Thanks</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Alfonso Vizcaino</dc:creator><pubDate>Mon, 27 Dec 2021 20:32:12 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202293/thread/03c94e2b4f/?limit=25#fd19</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e8f8d15d0/?limit=25#6301</link><description>In Tess4J, PDF documents are converted to grayscale images by Ghostscript or PDFBox before feeding to Tesseract OCR engine. You can do your own conversion of PDF files before the OCR processing.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Fri, 15 Oct 2021 02:35:57 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e8f8d15d0/?limit=25#6301</guid></item><item><title>John Mc.Queide Clemente modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e8f8d15d0/?limit=25#8828</link><description>I am doing a OCR in a PDF file, but the PDF result file loses its color. Am I doing something wrong? That doesn't happen when my input file is a PNG file. This is my code snippet public class OcrServiceImpl implements OcrService { @Override public void doOcr(String inputPath, String outputPath) { try { List&lt;ITesseract.RenderedFormat&gt; renderList = new ArrayList&lt;&gt;(); renderList.add(ITesseract.RenderedFormat.PDF); Tesseract tesseract = new Tesseract(); tesseract.setOcrEngineMode(0); tesseract.setDatapath("C:\\Program...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">John Mc.Queide Clemente</dc:creator><pubDate>Thu, 14 Oct 2021 17:20:44 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e8f8d15d0/?limit=25#8828</guid></item><item><title>John Mc.Queide Clemente posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e8f8d15d0/?limit=25#8828</link><description>I am doing a OCR in a PDF file, but the PDF result file loses its color. Am I doing something wrong? That doesn't happen when my input file is a PNG file. This is my code snippet public class OcrServiceImpl implements OcrService { @Override public void doOcr(String inputPath, String outputPath) { try { List&lt;ITesseract.RenderedFormat&gt; renderList = new ArrayList&lt;&gt;(); renderList.add(ITesseract.RenderedFormat.PDF); Tesseract tesseract = new Tesseract(); tesseract.setOcrEngineMode(0); tesseract.setDatapath("C:\\Program...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">John Mc.Queide Clemente</dc:creator><pubDate>Thu, 14 Oct 2021 16:27:49 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/3e8f8d15d0/?limit=25#8828</guid></item><item><title>Quan Nguyen modified a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0980936ff0/?limit=25#62a0</link><description>MS Document formats are not supported. The library can only produce the output formats that Tesseract supports.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Sat, 21 Aug 2021 04:28:31 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0980936ff0/?limit=25#62a0</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0980936ff0/?limit=25#62a0</link><description>MS Word format is not supported. The library can only produce the output formats that Tesseract supports.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Sat, 21 Aug 2021 04:27:58 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0980936ff0/?limit=25#62a0</guid></item><item><title>Quan Nguyen posted a comment on ticket #4</title><link>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#c8d6</link><description>Please continue the discussion either in the Discussion section or over on GitHub site rather than on this old, closed ticket. Thanks.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Wed, 05 May 2021 01:32:01 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#c8d6</guid></item><item><title>Peter Kronenberg modified a comment on ticket #4</title><link>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#8895</link><description>I see TessBaseAPIAllWordConfidences, which says that it returns the same number of values as that returned by GetUTF8. But TessBaseAPIGetUTF8Text returns a single string, not an array. Can you provide an example? I've read the Javadoc, but it's not always clear without an example. Is there an efficient way to process multiple images, but one at a time, without sending them all in as an array. TessBaseAPIAllWordConfidences() doesn't seem to work with doOCR(), because doOCR() closes everything down...</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Peter Kronenberg</dc:creator><pubDate>Tue, 04 May 2021 01:43:45 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#8895</guid></item><item><title>Peter Kronenberg posted a comment on ticket #4</title><link>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#8895</link><description>I see TessBaseAPIAllWordConfidences, which says that it returns the same number of values as that returned by GetUTF8. But TessBaseAPIGetUTF8Text returns a single string, not an array. Can you provide an example? I've read the Javadoc, but it's not always clear without an example. Is there an efficient way to process multiple images, but one at a time, without sending them all in as an array</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Peter Kronenberg</dc:creator><pubDate>Tue, 04 May 2021 01:41:56 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#8895</guid></item><item><title>Quan Nguyen posted a comment on ticket #4</title><link>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#88a3</link><description>Documentation: http://tess4j.sourceforge.net/docs/docs-4.4/ You can pass in a List&lt;IIOImage&gt; to doOCR method. There are other methods in Tesseract class that returns confidence values. JNA Direct Mapping: https://github.com/java-native-access/jna/blob/master/www/DirectMapping.md</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Mon, 03 May 2021 23:54:05 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#88a3</guid></item><item><title>Anonymous posted a comment on ticket #4</title><link>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#8085</link><description>I know this issue is a years old, but I'm wondering what is the current 'best' way to get the confidences? Like others, I am also confused by the difference between Tesseract vs Tesseract1 and TessAPI vs TessAPI1 I see what you said about doOcr() being intended for a single image because it shuts down after processing. What is the best way to be able to process multiple images? Is there any documentation on the best way to do this (as well as getting the confidences) thank you</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Anonymous</dc:creator><pubDate>Mon, 03 May 2021 23:46:02 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#8085</guid></item><item><title>Peter Kronenberg posted a comment on ticket #4</title><link>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#0d69</link><description>I just entered that last post, but I wasn't logged in.</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Peter Kronenberg</dc:creator><pubDate>Mon, 03 May 2021 15:26:49 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/bugs/4/?limit=25#0d69</guid></item><item><title>sriKrishnaKumar posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0980936ff0/?limit=25#6527</link><description>Hello Team, I am looking to develop an application internally to do convert Image format to Searchable PDF and then Searchable PDF to Microsoft document format or directly from Image Format to Microsft Document Format. Does Tess4J along with other library supports this requirement. I know we can use Tess4j to convert image to Searchable PDF. Any suggestions are welcome</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">sriKrishnaKumar</dc:creator><pubDate>Thu, 15 Apr 2021 20:54:15 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0980936ff0/?limit=25#6527</guid></item><item><title>Quan Nguyen posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0cb54e6c42/?limit=25#4025</link><description>The Leptonica API method seems to have changed over the years after several versions. http://tess4j.sourceforge.net/docs/lept4j-docs-1.10.0/net/sourceforge/lept4j/Leptonica1.html http://tess4j.sourceforge.net/docs/lept4j-docs-1.14.0/net/sourceforge/lept4j/Leptonica1.html#pixaaDisplayByPixa(net.sourceforge.lept4j.Pixaa,int,float,int,int,int)</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quan Nguyen</dc:creator><pubDate>Wed, 14 Apr 2021 18:04:13 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0cb54e6c42/?limit=25#4025</guid></item><item><title>Jeremy Young posted a comment on discussion Help</title><link>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0cb54e6c42/?limit=25#f4f9</link><description>IntelliJ is telling me that the parameters for pixaaDisplayByPixa are different from the documentation. Have I done something wrong? If not, is there a workaround? Thx</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jeremy Young</dc:creator><pubDate>Wed, 14 Apr 2021 14:40:08 -0000</pubDate><guid>https://sourceforge.net/p/tess4j/discussion/1202294/thread/0cb54e6c42/?limit=25#f4f9</guid></item></channel></rss>