458 views
0 0 votes

I need some tool to classify articles based on short category text which consists of two or three words separated by '-'. The RSS/XML tag content is for example:

Foreign - News

Football - Foreign

I created my own categories in DB and now I need to classify categories from parsed RSS of this news source, so it fits news categories defined by me.

I would, for example need all articles containing category "football" to be identified as a category Sport but sometimes those categories XML tags contains exact match like Foreign - News should belong in the DB to category defined by me as Foreign.

Since I used only trained decision trees frameworks from AI so for another project so far, I would like to hear advice about probably AI based approach, technique or particular framework I can use to solve this problem. I don't want to get into a dead-end street by my own poor, in the field of AI not very experienced decision.

While it can be solved by many ifs and 'contains' function, it seems to me like not a very good solution.

TLDR; I need basically something like "clever, flexible and universal if-elseif".

NOTE: I can also use article description text, if that would be necessary but it seems to me that this former category text is  unambiguous enough for this kind of problem.

Please log in or register to answer this question.

Related questions

0 0 votes
0 0 answers
495
495 views
ntonis asked Jan 30, 2021
495 views
I am trying to create a sentiment analysis model using binary classification as loss.I have a batch of tweets that some of them are tagged as positive (labeled as 1) and ...
0 0 votes
0 0 answers
1.3k
1.3k views
NewDS64 asked Feb 22, 2023
1,311 views
When pre processing data for machine learning. Is there any difference in using one hot encoding to turn categoric variables into numeric variables or to segment the data...
1 1 vote
1 1 answer
1.4k
1.4k views
Anas asked Dec 18, 2021
1,360 views
It's a car prices dataset, and so I'm assuming that the more recent the more value a car should have. The values in the 'year' column simply consist of years from 1995 to...
2 2 votes
1 answers 1 answer
4.9k
4.9k views
tofighi asked Jun 26, 2019
4,859 views
We want to use Naive Bayes for tagging documents. It is a classification task that we want to assign a class (tag) to each string. We currently have two tags: Sport and N...
0 0 votes
1 1 answer
436
436 views
kevinw asked Sep 20, 2018
436 views
In addition to lower-case, the readability of printed text can be increased by what?