Escolar Documentos
Profissional Documentos
Cultura Documentos
2
Course Objectives
3
Grading Policy
4
Grading Policy
5
Homework (320 points)
§ Other excuses …
6
Midterm & Final Exams
7
Final Group Project (360 points)
• Identify a machine learning problem. The instructor will provide a list of projects
from Kaggle.com.
• Do literature review to find existing solutions (e.g., by other teams on Kaggle) to
your problem. Understand what methods they used.
• Identify your machine learning algorithm(s) for the problem, implement and
optimize it (them), upload to Kaggle and obtain your ranking at Kaggle;
• Your final report shall include: 1) introduction: the statement of the problem, 2)
related work: existing solutions to this problems, 3) your solutions: i) a description
of the dataset, ii) machine learning algorithm(s) you use and why they are
considered, iii) the implementation process, including how did you tune the
parameters to get the best results, 4) comparison of your results with those by
top ranking Kaggle teams in terms of accuracy, computational cost, and other
advantages/disadvantages, 5) any future research directions to improve your
algorithm, 6) conclusion, and 7) references.
• THREE (3) members in each group. Instructor will assign if you can’t find a team.
8
Final Group Project (360 points)
Deliverables:
1. Proposal (20 points): one-page PDF file including
1) Problem statement
2) Description of data set
3) Implementation plan
4) Team members & task allocation
9
Final Group Project (360 points)
Deliverables:
4. Final report (240 points): ≥ 6 pages in a PDF file
1) Shall contain ALL the seven sections (see slide # 8)
2) Written using IEEE format template (available in Canvas)
Editing tool: latex (most widely used in academia; learn to use it)
3) Source code: made available at Kaggle.com
5. Final presentation video (60 points)
1) UP to 12 minutes each team
2) Slides shall contain enough details but concise
Final Report Grading criteria:
a) PDF report (60 points): mainly based on the quality of writing, e.g., it shall contain all
the seven sections, provide sufficient discussion, and is written and organized
professionally.
b) Overall quality (180 points): mainly based on your ranking at Kaggle.com.
top 10%: 180 points; top 10 – 20%: 170 points; top 20 - 30%: 160 points;
top 30 - 40%: 150 points; top 40 - 50%: 140 points; top 50 - 60%: 130 points;
top 60 - 70%: 120 points; top 70 – 80%: 110 points; top 80 - 90%: 100 points.
other: based on actual code quality but no more than 100 points.
10
Tentative Schedule (see syllabus)
11
Machine Learning Datasets
• UCI Repository:
o http://www.ics.uci.edu/~mlearn/MLRepository.html
o http://kdd.ics.uci.edu/summary.data.application.html
o http://yann.lecun.com/exdb/mnist/
• Face Databases (a good collection of various face databases )
o http://web.mit.edu/emeyers/www/face_databases.html#ar
• Statlib:
o http://lib.stat.cmu.edu/
• Delve:
o http://www.cs.utoronto.ca/~delve/
• Kaggle(business applications)
o https://www.kaggle.com/
12
Related Journals and Conferences
• Journals:
– Journal of Machine Learning Research
– Machine Learning
– IEEE Transactions on Pattern Analysis and Machine Intelligence
– IEEE Transactions on Neural Networks
– Neural Computation
– Neural Networks
– ...
• Conferences:
– International Conference on Machine Learning (ICML)
– The IEEE International Conference on Data Mining (ICDM)
– Neural Information Processing Systems (NIPS)
– International conference on Knowledge Discovery and Data
Mining (KDD)
– ...
13
Machine Learning Software
• Python
o Default language for programming assignments & project
• Matlab
• R/Splus
• JAVA
• SAS
14
Ethic Statement
All Stevens graduate students promise to be fully truthful and avoid dishonesty, fraud,
misrepresentation, and deceit of any type in relation to their academic work. A student’s
submission of work for academic credit indicates that the work is the student's own. All outside
assistance must be acknowledged. Any student who violates this code or who knowingly
assists another student in violating this code shall be subject to discipline.
All graduate students are bound to the Graduate Student Code of Academic Integrity by
enrollment in graduate coursework at Stevens. It is the responsibility of each graduate student
to understand and adhere to the Graduate Student Code of Academic Integrity. More
information including types of violations, the process for handling perceived violations, and
types of sanctions can be found at www.stevens.edu/provost/graduate-academics.
15
Ethic Statement
16