联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-23:00
  • 微信:codinghelp

您当前位置:首页 >> Python编程Python编程

日期:2018-12-07 10:50

FINAL PROJECT: Analysis of President Trump’s Tweets and Stock Market performance.

The project will involve textual pre-processing, data cleaning, word embedding, and running regressions

in Python. There are two parts to it.

Part I requires you to clean twitter data and look for relevant words. You need to find tweets containing

the word stock (for stock market) and a second a term of your own choosing. For instance, you can look

for tweets talking about china, war, trade, unfair, great, border, refugee (anything really). Then you will

merge twitter data with stock market indices. Here also you will run some word embedding and see

what word2vec model can learn about meaning of words through Trump’s tweets.

In Part II you will run regressions on stock market indices. You need to run regressions for:

One of the dummy variables has to be a dummy for stock tweets and the second one of your choice (i.e.

dummy for tweets mentioning china, war, trade and so on). Also for your second term, you need run

additional regressions with sentiment analysis dummies (positive and negative). You will run these

regressions for 2 time periods (2010- now and 2016-now). There are 12 indices, so in total you will have

96 regressions.

There are additional details in Python script/file itself on what to do and what’s required.

For this project, I would highly recommend everyone to use Jupyter Notebook. Jupyter Notebook is

another Python code reader, but it allows you to run one line at a time, and it will be extremely useful

for this project.

Instructions to Install Jupyter Notebook for Mac Users:

1. Pip3 install jupyter notebook

Windows Users:

1. Python -m pip install jupyter notebook

Here is a tutorial/article on how to use it from CodeAcademy.

The due date for this project is midnight of December 7th. You will need to submit the following three

things:

1. Python code

2. (HTML) Output of regressions

3. Word File:

a. Interpretations of regression output

b. Limitations of these regressions

You will need to download Part I and Part II Python files (available both in Jupyter Notebook and plain

Python format) and stock market data csv.

For this assignment you can work in groups. At most there can be 3 people in a group, and you can work

on the same code and write-up. However, for each member of the group, there should be a

corresponding number of extra terms (i.e. tweets containing the word “stock” + extra terms, such as

trade, war, border, refugee). For extra terms, do not forget to do the sentiment analysis and run all the

regressions. When submitting the assignment as a group, submit the same version of the code and

write-up separately so it is easier to track who submitted what.


版权所有:留学生编程辅导网 2020 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp