Monday, April 29, 2013

mecab buffer error


Today, I try the Mecab to deal with Japanese word segmentation problem.
But the buffer-errors comes out.
input-buffer overflow. The line is split. use -b #SIZE option.
we can set a bigger buffer to fixed this problem. 
for example, the default size is 8192, we can change it to 8192 * 5
mecab -b 40960 < data

Friday, April 19, 2013

Best paper VS Top Cited Papers in CS


I found a very interesting web sits. It compares the best paper and top cited papers in computer science conference.


Saturday, April 13, 2013

Fix the problem "tar: Failed to set default locale" in Mac OS X

write the following in terminal

defaults write org.R-project.R force.LANG en_US.UTF-8

and restart R

from:
http://davidprakash.blogspot.jp/2011/05/r-error-tar-failed-to-set-default.html