hadoop - Twitter Data Analysis using mahout -
i have collected twitter data using flume current research project. thinking of extracting texts these flumedata files. want mahout text clustering on these tweets. can suggest me how able ?
so far ,
- i have used flume collect twitter data
- i parsed data using hive , constructed table tweets consisting of text tweets.
hive -e 'select * tweets' > sample.txt, gives me tweets text document.
i used hive here parse data .. there other way of doing this? cos concern split tweets multiple text documents can perform mahout text clustering.
Comments
Post a Comment