CSE 6250: Big Data for Health Informatics
General Information
Syllabus and schedule for the Spring 2017 offering are online here.
Ungraded course labs are available Dr. Sun's here. The assignments go much deeper than these do, but they are a useful resource for setting up your environment(s) and getting your feet wet with the technologies you'll learn during the course.
Resources
- Learn Scala in Y minutes (Scala crash course): https://learnxinyminutes.com/docs/scala/
- One-hour Scala tutorial https://www.youtube.com/watch?v=DzFt0YkZo8M
- Just Enough Scala for Spark (how to use Scala in Spark): https://www.youtube.com/watch?v=LBoSgiLV_NQ
- Spark RDD Api examples: http://homepage.cs.latrobe.edu.au/zhe/ZhenHeSparkRDDAPIExamples.html
- IDE: The Scala plugin for IntelliJ idea is a very popular choice for syntax completion and type resolution https://confluence.jetbrains.com/display/SCA/Scala+Plugin+for+IntelliJ+IDEA
Prep Strategy/Tips
- Complete the labs on your own: Just setting up the environment beforehand for each lab, will save you lots of time when you have to work on the assignments.
- Don't just copy/paste the source from the labs, try to solve the exercises on your own, and/or try to understand what does each line of code do.
- You don't really need to complete a 3-4 months long course in Scala to succeed in this class. If you understand each line of code from the first four labs in this page, you will be fine: http://www.sunlab.org/teaching/cse6250/spring2017/lab/spark/