Posts

Showing posts from March, 2016

Open Data - Hong Kong Air Quality & SO2 conentration

Image
Recently, I have attended the Open Data Hackathon 2016 organized in Hong Kong, and join a team with others from different backgrounds. It is really a great opportunity to learn from the experienced, how they process the data with different kinds of exploration techniques, visualize and present the analytic results. The most valuable is to know a new technique, Autoregression to apply on the time series data. Previously, I usually adopt the multiple linear regression on the non-time-series data to investigate the factors effecting on dependent variables. Additionally, our team was focusing on the air pollution in Hong Kong and wished to figure out how the containment gas was dependent of policy, sources and environment so as to think a series of improvement way or policy to alleviate the problems. Of course, there were only limited time and resources, it was ideal to finish within one day. In order to explore such data, we selected one of the polluted air, Sulphur dioxide to study. Ho...

女生看男生,男生看女生?

Image
最近,我在一個在杜雷斯專頁的貼文中(利申:沒有收取廣告費,要給我是歡迎的,笑)看到了有一個決策樹(Decision Tree),我本來一笑而之,可是再想一想,用直覺去判斷這個樹的設計,也好像是挺合理的。突然,我的靈感就跑出來,如果用數據去建模(Modeling)的話,結果會一樣嗎?這個無聊的想法也驅使我做更無聊的事,就是把分類決策樹(Classification Tree)建造出來 LOL,看看朋友們的想法是否接近這幅圖所表達的~ 如果自己沒有膽量,又想了解女神或者男神對你的感覺,這棵樹應該可以幫助你,因為它能大概預測到男神或女神對你有沒有好感! 剛好筆者正在學習使用R,所以分析和建模的那部分這一次就交給R,來當作練習吧~所以對不起了,Python大大~_~   來源:杜雷斯Facebook專頁 首先,我認清自己的目標和方向 目標變數 (Target Variables): 對異性的感覺,值有以下3個: $y_1$: 有好感的(好感) $y_2$: 沒有好感,但覺得是好人 (好人) $y_3$:  完全沒有好感(BYE~) 輸入變數(Input Variables): $x_1$: 外表, Appearance(帥vs不帥, 美女vs不好看) $x_2$: 交流, Chatable(聊得來,聊不來) 為了獲得數據,最直接了當就是去做訪問,而訪問對象當然就是自己的朋友們 (還好沒有傷害到友情) ,去了解他們對自己的朋友X先生/Y小姐們的感覺。 想直接看結果,直接拉去 ===分隔線=== 吧~ 於是我設計幾個問題, 來 刁難 訪問我幾位的朋友們: 1a. 你覺得X先生/Y小姐外表吸引嗎?     對/不對 1b. 你覺得X先生/Y小姐的樣子帥嗎?     對/不對 1c. 你覺得X先生/Y小姐的身材好嗎?     對/不對 2a. 你跟X先生/Y小姐聊得來嗎?               對/不對 2b. 你會想繼續跟X先生/Y小姐...