hadoop - Twitter Data Analysis using mahout -


i have collected twitter data using flume current research project. thinking of extracting texts these flumedata files. want mahout text clustering on these tweets. can suggest me how able ?

so far ,

  1. i have used flume collect twitter data
  2. i parsed data using hive , constructed table tweets consisting of text tweets.
  3. hive -e 'select * tweets' > sample.txt , gives me tweets text document.

i used hive here parse data .. there other way of doing this? cos concern split tweets multiple text documents can perform mahout text clustering.


Comments

Popular posts from this blog

java - Run spring boot application error: Cannot instantiate interface org.springframework.context.ApplicationListener -

python - pip wont install .WHL files -

Excel VBA "Microsoft Windows Common Controls 6.0 (SP6)" Location Changes -