Demo: question answering on HTML documents

Julien Simon
Jan 16, 2023

What if you could ask questions on HTML documents, without having to convert them to plain text first? Well, that’s exactly the purpose of the Microsoft MarkupLM: just grab a page and ask a question.

I’ve built a Hugging Face Space to let you experiment with any live URL. I also implemented multithreading to speed things up on CPU. Give it a go and let me know what you think :)

✅ Space: https://huggingface.co/spaces/juliensimon/webpage_questions
✅ Model: https://huggingface.co/microsoft/markuplm-base-finetuned-websrc
✅ Paper: https://arxiv.org/abs/2110.08518

--

--