Skip to content
Snippets Groups Projects
clean_html.Rd 631 B
Newer Older
Lucas Schons's avatar
Lucas Schons committed
% Generated by roxygen2: do not edit by hand
Lucas Schons's avatar
Lucas Schons committed
% Please edit documentation in R/clean_html.R
\name{clean_html}
\alias{clean_html}
\title{Clean a html formatted wikipedia page.
Lucas Schons's avatar
Lucas Schons committed
Nodes of interest from the DOM are extracted and then cleaned from all html
tags and annotations.}
\usage{
Lucas Schons's avatar
Lucas Schons committed
clean_html(html)
Lucas Schons's avatar
Lucas Schons committed
}
\arguments{
\item{html}{Url linking to a wikipedia webpage or a html formatted document.}
}
\value{
Plaintext document containing only the maintext of the give wikipedia page.
}
\description{
Clean a html formatted wikipedia page.
Lucas Schons's avatar
Lucas Schons committed
Nodes of interest from the DOM are extracted and then cleaned from all html
tags and annotations.
}