filmrezension.de

300 movie reviews scraped from German movie review site. 

Original files downloaded from site are in filmrezension.de_raw

The files have been formatted to look like Pang / Lee data; the output of that process are in filmrezension.de_lines

To go from raw to that format:

python raw_to_lines.py "filmrezension.de_lines/*.*" 