Reddit Reddit reviews Text Analysis with R for Students of Literature (Quantitative Methods in the Humanities and Social Sciences)

We found 1 Reddit comments about Text Analysis with R for Students of Literature (Quantitative Methods in the Humanities and Social Sciences). Here are the top ones, ranked by their Reddit score.

Computers & Technology
Books
Computer Software
Mathematical & Statistical Software
Text Analysis with R for Students of Literature (Quantitative Methods in the Humanities and Social Sciences)
Check price on Amazon

1 Reddit comment about Text Analysis with R for Students of Literature (Quantitative Methods in the Humanities and Social Sciences):

u/sazken ยท 2 pointsr/GetStudying

Yo, I'm not getting that image, but at a base level I can tell you this -

  1. I don't know you if you know any R or Python, but there are good NLP (Natural Language Processing) libraries available for both

    Here's a good book for Python: http://www.nltk.org/book/

    A link to some more: http://nlp.stanford.edu/~manning/courses/DigitalHumanities/DH2011-Manning.pdf

    And for R, there's http://www.springer.com/us/book/9783319207018
    and
    https://www.amazon.com/Analysis-Students-Literature-Quantitative-Humanities-ebook/dp/B00PUM0DAA/ref=sr_1_9?ie=UTF8&qid=1483316118&sr=8-9&keywords=humanities+r

    There's also this https://www.amazon.com/Mining-Social-Web-Facebook-LinkedIn/dp/1449367615/ref=asap_bc?ie=UTF8 for web scraping with Python

    I know the R context better, and using R, you'd want to do something like this:

  2. Scrape a bunch of sites using the R library 'rvest'
  3. Put everything into a 'Corpus' using the 'tm' library
  4. Use some form of clustering (k-nearest neighbor, LDA, or Structural Topic Model using the libraries 'knn', 'lda', or 'stm' respectively) to draw out trends in the data

    And that's that!