- i really want to brainstorm this one with the people here on HN
- i am trying to do something along these lines
- What I am looking for is "informationally dense" articles across a blog
- For example, a person writing "Ai is going to do this" and "how I felt at my company when they adopted AI" are pure opinions.
- ON the other hand, a post like "here are 10 ways to loop a directory in bash" is informationally dense
- What sort of techniques / algorithms do you think I could use to narrow it down. I can think of removing stop words from the post, counting the ratio of remaining words to total words (not sure if that means anything), n gram analysis maybe but I am really not an expert at this
- Perhaps someone at HN can shed some light on how to go about identify "information rich" articles on a blog
- Do you think LLMs would do a good job if we were to loop through every post on a blog and ask LLMs to pick non opinion ones