Bayesian classifier for TTRSS

Development-related discussion, including bundled plugins
User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 18:55


nameless
Bear Rating Master
Bear Rating Master
Posts: 126
Joined: 28 Aug 2013, 20:33

Re: Bayesian classifier for TTRSS

Postby nameless » 17 Jun 2015, 19:04


User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 19:06

plugin should create the tables when it loads, strange.

nameless
Bear Rating Master
Bear Rating Master
Posts: 126
Joined: 28 Aug 2013, 20:33

Re: Bayesian classifier for TTRSS

Postby nameless » 17 Jun 2015, 19:12

it is.
can anybody confirm this as a bug or did i fuck up my postgres setup?

User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 19:23

try updating to latest trunk, it's very unlikely that something is wrong with your database

i mean you can always disable the plugin and it'll go back to normal

nameless
Bear Rating Master
Bear Rating Master
Posts: 126
Joined: 28 Aug 2013, 20:33

Re: Bayesian classifier for TTRSS

Postby nameless » 17 Jun 2015, 19:32

I just updated to latest to latest trunk things are still broken though.
Also disabling the plugin doesen't revert things back to normal, errors still occur.

User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 19:35

if you disable the plugin while updates are ongoing you are going to continue having errors until all ttrss processes terminate

i just checked on mysql too and table was created automatically btw

e: also your log doesn't show anything about plugin failing to create the table, only that relation doesn't exist, strange

nameless
Bear Rating Master
Bear Rating Master
Posts: 126
Joined: 28 Aug 2013, 20:33

Re: Bayesian classifier for TTRSS

Postby nameless » 17 Jun 2015, 19:38

so it looks as if i broke my database :(

User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 19:38

like I said it is very unlikely, i'd even say impossible

nameless
Bear Rating Master
Bear Rating Master
Posts: 126
Joined: 28 Aug 2013, 20:33

Re: Bayesian classifier for TTRSS

Postby nameless » 17 Jun 2015, 19:42

mhm.
is there anything else i can provide that could help debug this?

User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 19:43

oh wait there was a typo in the psql schema script, update and it should fix itself

e: there was another thing in there, gonna be fixed in a few minutes

not the smoothest rollout so far :)

e2: done

nameless
Bear Rating Master
Bear Rating Master
Posts: 126
Joined: 28 Aug 2013, 20:33

Re: Bayesian classifier for TTRSS

Postby nameless » 17 Jun 2015, 19:47


User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 19:49


User avatar
fox
^ me reading your posts ^
Posts: 6318
Joined: 27 Aug 2005, 22:53
Location: Saint-Petersburg, Russia
Contact:

Re: Bayesian classifier for TTRSS

Postby fox » 17 Jun 2015, 21:45

i only have a few articles in BAD category and i'm seeing it match way too much, maybe its more emphasized because there's not much data so every word counts more or whatever

maybe limiting to two categories would work better, idk

himynameschris
Bear Rating Trainee
Bear Rating Trainee
Posts: 2
Joined: 17 Jun 2015, 03:24

Re: Bayesian classifier for TTRSS

Postby himynameschris » 18 Jun 2015, 01:03

I would say that not having enough articles trained and then used to update the model is going to be the biggest reason for your results, you should try with 10 or 20 in each category.

I've also noticed that the plugin is only removing the words "'the', 'that', 'you', 'for', 'and'", this list of stopwords (words that don't carry enough meaning to be included in Bayesian classification) could be expanded to include other common words.

For example, I've noticed that some RSS feeds such as those from Ars Technica, Forbes and BBS will contain text such as "Read more" or "Continue reading", or even "this is an article about...", these alone could throw your results off. I think this would greatly improve any model based on RSS feeds.


Return to “Development”

Who is online

Users browsing this forum: No registered users and 2 guests