project:chparlscraping

This is an old revision of the document!


In this project, we are planning to:

  1. Scrape the parliament website in order to retrieve
    1. councilors bio
    2. minutes
  2. Structure them:
    1. session
    2. intervention (with rank/order)
    3. author
    4. text
  3. Analysis
    1. person vs. vocabulary
    2. dialogue order
    3. gender

Our github.

  • Soon !
  • project/chparlscraping.1441362342.txt.gz
  • Last modified: 2015/09/04 12:25
  • by yrochat