Link to this page
Emsa HTML Tag Remover
Emsa HTML Tag Remover is a software utility that
allows removing html tags from a html file with some
extra degree of control on how the html is removed and
whitespace removal as well. It provides several
options to remove different types of data from the
html page. It allows whitespace removal, making
the resulting text output condensed as necessary.
Finally, it works both in interactive mode, as well as
command line mode, which can be useful for users
wanting to use this functionality from other programs
or batch files.
When the program is ran for the first time, all
options are checked by default. This is the most
advanced removal and the result will actually be only
one line of text with no carriage returns. We
recommend the user to take the time and play with it
in order to see the effect of each particular setting,
until the output becomes as the user wants it.
Users perform the html removal by selecting an input
html file, selecting or typing an output file (usually
as text file) and then clicking on 'remove html'
button. Please also view the command line mode below.
Some advanced functions:
- Remove all tags strips all remaining html data
comprised between < and > characters.
- Generate foreign & special characters. This is a
function that allows rendering foreign and special
sequences into their corresponding resulting
character. for example the '£' sequence will be
rendered as the corresponding '£' pound character;
the '>' sequence will be rendered as the
corresponding '>' character; and so on.
- Remove spaces will trim all unnecesary spaces from
the output; therefore between two words there will
always be one space max.
- Remove blank lines will cut out all unneded carriage
returns and line feeds, in such a way that all lines
having some text will be aligned one after another
with no blank lines in between.
This program is Freeware.