Open Source Java Text Processing Software - Page 3

Java Text Processing Software

View 91 business solutions

Browse free open source Java Text Processing Software and projects below. Use the toggles on the left to filter open source Java Text Processing Software by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Marketing automation for any business | ActiveCampaign Icon
    Marketing automation for any business | ActiveCampaign

    Your team of AI agents handles email, SMS, WhatsApp and more for you

    Active Intelligence revolutionizes how you work. You guide direction while AI handles execution, acts on insights, and shows you the path forward. It's how marketing should be.
    Learn More
  • 1
    A simple application to transform XML in CSV like file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    A universal suite of utilities for large corpora processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    It's a free and handy text editor for both plain text files and formatted text files and printing. Since release 1.2.0 there is a tool for the conversion in PDF format. It's written in Java language so it's available for a lot of Operative Systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Powerful Website Security | Continuous Web Threat Platform Icon
    Powerful Website Security | Continuous Web Threat Platform

    Continuously detect, prioritize, and validate web threats to quickly mitigate security, privacy, and compliance risks.

    Reflectiz is a comprehensive web exposure management platform that helps organizations proactively identify, monitor, and mitigate security, privacy, and compliance risks across their online environments. Designed to address the growing complexity of modern websites, Reflectiz provides full visibility and control over first, third, and even fourth-party components, such as scripts, trackers, and open-source libraries that often evade traditional security tools.
    Learn More
  • 5
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Emerald Text Editor (jEditor)

    Emerald Text Editor (jEditor)

    Emerald Text Editor is a tabbed text editor with heavy customizability

    Emerald Text Editor (Emerald Editor, or Emerald as I call it), formerly called jEditor, is a text editor that is much similar to notepad in the fact that it let's you edit text but it makes use of the tabbed panes which means that you can have multiple tabs up at once allowing you to edit multiple files at one time. Emerald Text Editor also comes with a toolbar which tells you how quickly you are typing and how many characters are in your current document. The program is also customizable, meaning you can edit some of the main features of the program. The name was changed to fit a future naming scheme I'm going to have.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    A basic html editor that I have been developing. In first stages of development. I hope to make this a very easy-to-use text/html editor and I have begun adding insert functions for common tags. I plan to add insert function for common Html Tags.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Parasoft: Automated Testing to Deliver Superior Quality Software Icon
    Parasoft: Automated Testing to Deliver Superior Quality Software

    Parasoft provides test automation for every phase of the software development life cycle.

    Parasoft helps organizations continuously deliver high-quality software with its AI-powered software testing platform and automated test solutions. Supporting the embedded, enterprise, and IoT markets, Parasoft’s proven technologies reduce the time, effort, and cost of delivering secure, reliable, and compliant software by integrating everything from deep code analysis and unit testing to web UI and API testing, plus service virtualization and complete code coverage, into the delivery pipeline. Bringing all this together, Parasoft’s award-winning reporting and analytics dashboard provides a centralized view of quality, enabling organizations to deliver with confidence and succeed in today’s most strategic ecosystems and development initiatives—security, safety-critical, Agile, DevOps, and continuous testing.
    Learn More
  • 10
    NRtfTree library is a set of classes written entirely in C# which may be used to manage (read and write) RTF documents in your own applications. A java port of the library can be found in http://www.sgoliver.net/blog/?page_id=92
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Piccolo is the fastest SAX parser for Java, supporting SAX1, SAX2, and JAXP (SAX only). Piccolo is different from other parsers in that it was developed using parser generators. It weighs 160K including XML APIs. See http://piccolo.sf.net for more info.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    majix

    RTF to XML Converter

    With MajiX you can automatically transform RTF files (Microsoft Word files) in XML. MajiX is Java compliant. You can convert Headings, lists (numbered or not), tables, bold, italics, underline and some more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15

    Change File Encoding

    Change encoding of text files.

    Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    This Project supports Japanese supplements and patches for Apache FOP.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Likhon is a context sensitive input method (transliterator) for natural Languages. It is designed to analyze the pattern and context of input character sequences and generate output characters based on a predefined map script.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    Musaheb

    An Arabic collocation extraction tool

    “Musaheb”, an Arabic collocation extraction tool that has been designed and implemented to overcome the limitations of existing collocation extraction tools. “Musaheb” is able to extract n-gram collocations up to 5-gram, in addition to extracting the collocates of the nodes (the word-types we are looking for its collocates) within a window size of zero to 15 words. Moreover, it provides eight collocation statistics to calculate the strength of the collocation, and permits the input of various constraints during node selection and collocate extraction. Based on the user preferences for the node, concordance and collocates selection, the tool saves all nodes and their associated collocates in an XML file; allowing easy conversion to different formats.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    A Collection of text utils for the shell.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    XVCL is a general-purpose language for configuring variants in all sorts of textual documents (including programs). It is based on frame technology. XVCL processor automates the customization process to produce system from specification of variants.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Task for ANT to produce documentation with (PDF)LaTeX using BibTeX, Makeindex and GlossTeX.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    This a text editor which has the option to save the content as a PDF document. It can also read existing .rtf documents and render them in the editor. These can then be saved as PDF there by providing a converter from RTF to PDF format.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB