Html2Text: Tricks to convert a web page into a plain text document

Html2Text

Html2Text is an interesting free application that will help us convert, all the content of a web page in a simple plain text document.

The profits can be immense if we take into account that the information that has been proposed in a specific web page, we may need to rescue it to a Word document; there are certain tricks to use this tool called Html2Text otherwise, a whole series of strange characters will appear in this process, which is nothing more than a simple conversation.

Why not copy and paste instead of using Html2Text

Someone might think at this time that an easier and correct way to extract the information content of a web page is in the "copy and paste"; Although it is true that this can offer good results, but with this task it is possible to transfer a large number of characters that are part of the html encoding of each web page. We recommend using a Html2Text so that you have a completely clean text and free of this type of characters, only having to do the following to achieve our objective:

  • Open the web page and go to the article in which you are interested in extracting its content.
  • Now you just have to copy the entire URL that belongs to said article.
  • Right-click on any part of the article content that you have opened in your browser.
  • From the contextual menu choose the option that says «Save as«
  • Choose a location on the hard drive and write the name you want.
  • Now open Html2Text and import to the file you copied earlier.
  • Select the button to start the conversion.

Html2Text 02

That's all we need to do with Html2Text, well in a matter of seconds we will have a file with the same name but in TXT format, which will contain all the information without any strange characters. You must take into account that the format to save the web page has to contemplate the option that says "full page" otherwise, words with an accent or others will appear in an unusual way.


Leave a Comment

Your email address will not be published. Required fields are marked with *

*

*

  1. Responsible for the data: Miguel Ángel Gatón
  2. Purpose of the data: Control SPAM, comment management.
  3. Legitimation: Your consent
  4. Communication of the data: The data will not be communicated to third parties except by legal obligation.
  5. Data storage: Database hosted by Occentus Networks (EU)
  6. Rights: At any time you can limit, recover and delete your information.

  1.   JOB said

    Very good yes sir. You've saved me a lot of "googlystic" search headaches. It is just what it promises and what I was looking for with the keywords that I have put. Thank you very much.