2010年12月11日星期六

Statistics / Machine Learning

Recently I subscribed a blog owned by Brendan O'Connor, a student of CMU's Eric.P.Xing.
Quite many suggestive articles came to me with detailed and subjective discussions. I felt common sense when it occurs the relationship between machine learning and statistics. In fact I'm having a class named Elements of Statistical Learning given by my supervisor and benefits a lot.
As what was listed in the comparison, O'Connor declare ML guys are much more(unfairly) lucky. However, aren't statisticians more welcomed by industries except IT?
Anyway, statistics is far beyond only distribution, mean, variance and toy probability.

Another recent issues is tools for research. Choices listed below :
Python : agile and save-time coding, a real programming language s.t. easy to combine several tasks, hadoop interface; immature libraries (possibly make codes do now work)
Matlab : most popular in ML(easy to find reference codes and interfaces) and vision(important for me), wonderful debugging; weak in presenting logic, slow execution(seems tricky to speedup)
Mathematica/Maple : brilliant symbolic derivation
R : most popular in statistics, better plotting(visualization); steep learning curve, syntax with unfamiliar philosophy(so might often forget some basic usages), poor GUI
C : hard and annoying to prototype one's idea
Java : full of redundancy

O'Connor gives positive comment on all languages. Comments are funny and well designed.

I'd think my choice will be Matlab for numerical and vision related computing while Maple for formula derivation : MM combination ! :-)

2010年9月10日星期五

敢更土一点吗

今天试图挖掘一点繁体常用词,看了一天同胞们做的网页
不知是年代久远,还是同胞做教育和语言学术的人不善美工,亦或繁体字体不妥,若干涉及网站的制作均是土的掉渣,无论排版、配色、表单表示等,尤其是排版,脑海中浮现danni同我描述的香港沿街巷的店面招牌——恨不得把店里有什么都写在上头。
对比前些日子阅读的2010年Web 设计风(上)2010年Web设计风(下)数据之美(九):50个精美绝伦的Infographics,这真是别样风情...下附四图样:
http://zhongwen.com/zi.htm
http://www.mandarintools.com/
http://www.edu.tw/files/site_content/M0001/86news/know.htm?open
http://203.64.42.21/iug/ungian/SoannTeng/chil/taihoa.asp

2010年7月7日星期三

Graduating Season

XixiuOuyang(me), LinjieSong, XinruoSun

DanniWang()

2010年5月30日星期日

Google against GFW


一张也许很old的图...

2010年1月25日星期一

开始墙外生活

刚收到室友邮件,内容如下,部分内容被“墙” :P
---------------------------------------------
已经买好,$5.99每月

ip:敏感词
ipv6:正在申请
先配了一个ssh代理
用户名:敏感词
密码:敏感词

windows的设置参考http://www.williamlong.info/archives/2057.html
linux下用 ssh -qTNn -D localhost:port 敏感词@敏感词(其中port 是某个端口号,如8000)
然后把浏览器里代理服务器的地址设置成localhost:port即可,代理类型为SOCKS5

2010年1月18日星期一

2010年1月16日星期六

和谐地图


click to get picture