
Registered since September 28th, 2017
Has a total of 4246 bookmarks.
Showing top Tags within 1 bookmarks
howto information development guide reference administration design website software solution service product online business uk tool company linux code server system application web list video marine create data experience description tutorial explanation technology build blog article learn world project boat download windows security lookup free performance javascript technical network control beautiful support london tools course file research purchase library programming image youtube example php construction html opensource quality install community computer profile feature power browser music platform mobile user process work database share manage hardware professional buy industry internet dance advice installation developer 3d search material access customer camera travel test standard review documentation css money engineering develop webdesign engine device photography digital api speed source program management phone discussion question event client story simple water marketing app content yacht setup package fast idea interface account communication cheap compare script study market live easy google resource operation startup monitor training
Tag selected: categorization.
Looking up categorization tag. Showing 1 results. Clear
Saved by uncleflo on December 23rd, 2018.
I am working on text classification using SVM. In a paper (Fuzzy Support vector machine for multi-class text categorization) the author has reduced the features(words) by applying the following criteria: "Eliminate the words that are ICF>log2, Uni<0.2 and TF_IDF<26". My question is how can we find TF_IDF value of a word. TF is a local measure and IDF is a global measure. TF_IDF gives different value for a word in each document. TF-IDF is the acronym for Term Frequency–Inverse Document Frequency. This metric aims at estimating how important is a keyword not only in a particular document, but rather in a whole collection of documents (corpus). Actually, a lot of common words like articles or conjunctions may appear several times in a document but they are not relevant as key-concepts to be indexed or searched. TF (Term Frequency) provides a measure about how frequently a term occurs in a document.
tf-idf question solution answer vector machine text categorization keyword estimate article lookup development server document
No further bookmarks found.