Why not use pandoc to convert html to markdown and have the LLM condense from there?