Activity: Talk or presentation › Science to science
Description
Text documents from the Internet and the local file system that apply XML and HTML markup can be processed with parsers. JSoup is a powerful Java implementation of a parser and transformer allowing to fetch marked up text directly from the Internet, process and convert it from e.g. HTML to xhtml, or to analyze the DOM elements in the parse tree in an easy manner. Using the ooRexx-Java bridge BSF4ooRexx (an external function and class library for ooRexx) it is very easy to take advantage of JSoup from ooRexx programs without a need to know Java at all. As Java class libraries can be run on all operating systems out of the box, such ooRexx programs will run unchanged on all operating systems like Windows, macOS, Linux, AIX and much more.