联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-23:00
  • 微信:codinghelp

您当前位置:首页 >> Python编程Python编程

日期:2018-12-12 09:39

Project - Predicting Airline Delays with Hadoop

One of the main goals is using machine learning algorithms to build predictive

models with Python packages and data analysis programs. Training the original

datasets is important to build models with its performance. Finding a good

combination of technologies and programming languages would be cruicial to

make a successful project.

Dataset The data can be downloaded from Bureau of Transportation Statistics

where it is described in detail. An other link to more detailed data can be found

here.

Bureau of Transportation Statistics:

https://www.transtats.bts.gov/OT_Delay/OT_DelayCause1.asp

Detail:https://www.transtats.bts.gov/Fields.asp?Table_ID=236

Possible tools

- Apache Pig - Hadoop?

- Python?

- scikit-learn

Report

The report should briefly cover the following topics :

— Problem Definition : What is the problem that you are trying to solve ? What are the

challenges of this problem ?

— Methodology : What is your methodology to attack the problem and the associated

challenges ? What is the computational and space complexity of your solution in terms of

input size ?

— Results and Discussion : What are the outcomes of the project ?

? — Guideline : Briefly explain which code was used for which task.?Note that your

report should not exceed 8 pages.?


版权所有:留学生编程辅导网 2020 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp