Language-Independent Features
Word and N-Gram Counting
The bagOfWords
and bagOfNgrams
functions support tokenizedDocument
input regardless of language. If you have a tokenizedDocument
array containing your data, then you can use these functions.
Modeling and Prediction
The fitlda
and fitlsa
functions support bagOfWords
and bagOfNgrams
input regardless of language. If you have a bagOfWords
or bagOfNgrams
object containing your data, then you can use these functions.
The trainWordEmbedding
function supports tokenizedDocument
or file input regardless of language. If you have a tokenizedDocument
array or a file containing your data in the correct format, then you can use this function.
See Also
stopWords
| removeWords
| normalizeWords
| bagOfWords
| bagOfNgrams
| tokenizedDocument
| fitlda
| fitlsa
| wordcloud
| addSentenceDetails
| addLanguageDetails