Type: nodetool.text.HtmlToText
Namespace: nodetool.text
Description
Converts HTML content to plain text using html2text. html, convert, text, parse, extract
Use cases:
- Converting HTML documents to readable plain text
- Extracting text content from web pages
- Cleaning HTML markup from text data
- Processing HTML emails or documents
Properties
| Property | Type | Description | Default |
|---|---|---|---|
| html | str |
HTML content to convert | `` |
| base_url | str |
Base URL for resolving relative links | `` |
| body_width | int |
Width for text wrapping | 1000 |
| ignore_images | bool |
Whether to ignore image tags | True |
| ignore_mailto_links | bool |
Whether to ignore mailto links | True |
Outputs
| Output | Type | Description |
|---|---|---|
| output | str |
Metadata
Related Nodes
Browse other nodes in the nodetool.text namespace.