Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
forecasting:start [2020/09/19 21:08] jclaudio [Goals] |
forecasting:start [2021/09/19 21:59] (current) |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== Forecasting ====== | ====== Forecasting ====== | ||
- | //Updated: 1/19/2016// | + | //Updated: Fall 2020// |
- | Forecasting focuses on creating a statistical model from data and then using it to predict future trends based on the test data. It is considered a field of data science/statistics. There is also a large craze concerning machine learning recently--the purpose of this team is to learn to adapt statistical and machine learning analysis techniques to weather-related data. | + | Forecasting focuses on creating a statistical model from existing data and then using it to predict future trends based on the test data. The purpose of this team is to take meteorological data from the weatherboxes and apply **machine learning** algorithms to predict future weather conditions across the Manoa campus. |
- | + | ||
- | ---- | + | |
+ | {{ :forecasting:forecastingapproach.png?500 |}} | ||
===== Introduction ===== | ===== Introduction ===== | ||
Line 15: | Line 14: | ||
===== History ===== | ===== History ===== | ||
+ | |||
+ | *Fall 2020 Edit* | ||
The SCEL Forecasting team was started in the Fall 2015 semester, headed by Jeremy Garcia under Dr. Anthony Kuh. In the prior months, a few people from the University of California, Santa Cruz, came down to give a presentation about the new iPython (and recently renamed then, Jupyter) platform. They expressed that the University of California system had invested quite heavily into the platform as an accessible, flexible, powerful, and convenient platform to teach analysis alongside the modern, Python programming language. Motivated to demo out this new platform for analysis, Dr. Kuh wanted the team to explore this platform and, with it, begin to learn more about machine learning, as it was a hot topic along with "big data" at the time. | The SCEL Forecasting team was started in the Fall 2015 semester, headed by Jeremy Garcia under Dr. Anthony Kuh. In the prior months, a few people from the University of California, Santa Cruz, came down to give a presentation about the new iPython (and recently renamed then, Jupyter) platform. They expressed that the University of California system had invested quite heavily into the platform as an accessible, flexible, powerful, and convenient platform to teach analysis alongside the modern, Python programming language. Motivated to demo out this new platform for analysis, Dr. Kuh wanted the team to explore this platform and, with it, begin to learn more about machine learning, as it was a hot topic along with "big data" at the time. | ||
Line 24: | Line 25: | ||
===== Goals ===== | ===== Goals ===== | ||
- | *Spring 2020 Edit* | + | *Fall 2020 Edit* |
- | We will be focusing on reviewing various analytical models and techniques in order to give us a solid grounding in forecasting. To do this, we will make use of (hopefully) Jupyter (an interactive Python environment and platform) or the statistical programming language, R. Both have extensive support for statistics and machine learning. We will try to primarily focus on Python. | + | We will be focusing on reviewing various analytical models and techniques in order to give us a solid grounding in machine learning. To do this, we will make use of a popular programming language used extensively in data science; Python. For our development environment of choice, we are going with Google Colaboratory due to it's convenient integration with Google Drive as well as free access to Google's compute hardware. |
- | * Learn Jupyter (formerly iPython) and possibly some R. | + | Although previous SCEL teams have already made progress in machine learning, the documentation is somewhat scarce. For this reason, we will largely be starting over from the beginning in Fall 2020. |
- | * Learn and review various statistical techniques and models as applied in Python/R. | + | |
- | * Program a dynamically learning algorithm that will automatically predict, update linear coefficients and trim/add to the feature spaces needed for forecasting to all the weather boxes. | + | |
- | ----- | + | Our objectives for this semester: |
+ | * Learn Python and Google Colab. | ||
+ | * Learn and review various statistical techniques and models as applied in Python. | ||
+ | * Start on a new and well documented machine learning framework for SCEL. | ||
+ | ===== Resources ====== | ||
- | {{:forecasting:blockdiagramsp16.jpg?700|}} | + | *Fall 2020 Update* |
- | ===== Resources ====== | + | This semester, we have decided to register for an online machine learning course on Udemy. The course we have chosen is **Machine Learning A-Z™: Hands-On Python & R In Data Science** by Kirill Eremenko. At the time of writing, it is the highest rated course on Udemy for Machine Learning. The price for the course is listed at $100+, but there are frequent sales on Udemy where the courses may be purchased at only 10% of the original cost. Just be patient. |
+ | |||
+ | The class is mostly designed for computer science students and therefore you should not expect to learn much about the mathematical proofs behind machine learning algorithms. | ||
+ | |||
+ | Previous SCEL machine learning teams have used Jupyter as their development environment of choice. The Udemy course mentioned above uses Google Colab, and so we switch over to that as well. | ||
+ | |||
+ | ==== Udemy ==== | ||
+ | * [[https://www.udemy.com/course/machinelearning/ | Udemy]] - Online learning platform | ||
+ | * [[https://www.udemy.com/share/101WciAEMbdlpWTXQJ/ | Machine Learning A-Z™: Hands-On Python & R In Data Science]] - Online course taken | ||
- | ==== Jupyter ==== | + | ==== Jupyter (No longer using)==== |
* [[http://jupyter.readthedocs.org/en/latest/install.html | Install iPython]] - Installation tutorial for iPython. Anaconda recommended for users new to Python. | * [[http://jupyter.readthedocs.org/en/latest/install.html | Install iPython]] - Installation tutorial for iPython. Anaconda recommended for users new to Python. | ||
* [[forecasting: iPython Tutorials]] | * [[forecasting: iPython Tutorials]] | ||
* [[.:how to download data]] | * [[.:how to download data]] | ||
- | ==== Connecting MATLAB to a database ==== | + | ==== Connecting MATLAB to a database (No longer using)==== |
{{:forecasting:connecting_matlab_to_pos_database_kdelchev.pdf|}}\\ | {{:forecasting:connecting_matlab_to_pos_database_kdelchev.pdf|}}\\ | ||
Line 50: | Line 61: | ||
===== Meeting Minutes ===== | ===== Meeting Minutes ===== | ||
+ | [[forecasting: Fall 2020 Meeting Minutes]]\\ | ||
[[forecasting: Spring 2017 Meeting Minutes]]\\ | [[forecasting: Spring 2017 Meeting Minutes]]\\ | ||
[[forecasting: Fall 2016 Meeting Minutes]]\\ | [[forecasting: Fall 2016 Meeting Minutes]]\\ | ||
Line 59: | Line 71: | ||
- | ===== Documentation and Notes ==== | + | ===== Documentation and Notes ===== |
+ | |||
+ | ==== Current Notes ==== | ||
+ | [[Data Preprocessing]]\\ | ||
+ | [[Regression]]\\ | ||
+ | [[Classification]]\\ | ||
+ | [[Clustering]]\\ | ||
+ | |||
+ | |||
+ | ==== Archived Notes ==== | ||
[[https://docs.google.com/document/d/1keX9LUpu0hDK2uMDyuJe8mySTMc1qQ0sMVJLVDT1odA/edit|General iPython Notes]] | [[https://docs.google.com/document/d/1keX9LUpu0hDK2uMDyuJe8mySTMc1qQ0sMVJLVDT1odA/edit|General iPython Notes]] | ||
\\ | \\ |