Extracts plain text from Markdown strings
Tokenizes an HTML string, extracting plain text while ignoring HTML tags