How does that work?
I found a reference to N-gram Duplicate Checking. It is based on Postgres, which I do not have in my webspace package.
I am asking because I have the following problem:
I subscribe to feeds on flightglobal.com. They are all in one category.
They re-publish content from one feed to others, so I see the same item twice, but from different feeds.
Two example GUIDs from the ttrss_feeds table:
Code: Select all
The content and content hash values are identical.
It would be easy to weed out the second post based on the id number in the GUID, in combination with title and content.
I have deactivated "allow duplicates" in the settings already.
Is duplicate detection possible per category, or across multiple feeds ?
If not, how could that be done?
Can I help?