by chelya on Sun Apr 08, 2007 12:58 pm
Dan, Thank you for your reply. I am still not convinced we are on the same page about what I am talking about.
You are right - there are no duplicates when feed is downloaded within that particular download. However the following (correct, but undesired) behavior happens:
Let's say rss reader is pulling the feed every hour.
1. 10am - get all the messages (let's say 30 messages are downloaded)
2. 11am - reader will get 30 latest messages correctly detecting and not showing the oldest 5 because those topic where not updated within that hour and messages are the same. Then it shows 20 messages that were updates to the topics already downloaded on step 1 since new messages were posted to those topics. Granted it highlights new messages, but on the topics already downloaded in step 1. The latest 5 are new topics that were created within last hour. As a result I am getting 25 new messages in the reader. If I happened to not check the reader since 10am - I will get 55 new messages overall in the reader and 25 will be the duplicates (means the same topic, but different posts within).
What I would rather have:
1. 10am - get all the topics (or threads) (let's say 30 messages are downloaded)
2. 11am - reader will get 30 latest topics correctly detecting and not showing the oldest 25 because those are not new topics even though the messages were posted to them - we are downloading topics, not messages. Then the rss reader would only show 5 newest entries representing new topics appearing after 10am.
Dan, I hope I am not wasting your time. I have a lot of appreciation for what you do and I understand what kind of effort it takes. Either way - whether you decide to look into it or not - thank you.