Tracking a Historic Market Crash through Articles

From FDHwiki
Revision as of 12:24, 2 December 2023 by Xingyu.pan (talk | contribs) (Created page with "== Project Plan and Milestones == ===Weekly Project Plan=== {| class="wikitable" width="80%" ! Week !! Tasks !! Completion |- | Week 4 | * Paper reading. * News dataset and financial labels exploring. | ✓ |- | Week 5 | * Learning NLP data preprocessing. * Finishing data preprocessing on New York Times dataset. | ✓ |- | Week 6 | * Initially defining the entire project's workflow and extracting key metrics. * Finishing data preprocessing on New York Times dataset....")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Project Plan and Milestones

Weekly Project Plan

Week Tasks Completion
Week 4
  • Paper reading.
  • News dataset and financial labels exploring.
Week 5
  • Learning NLP data preprocessing.
  • Finishing data preprocessing on New York Times dataset.
Week 6
  • Initially defining the entire project's workflow and extracting key metrics.
  • Finishing data preprocessing on New York Times dataset.
Week 7
  • Completing the feature engineering construction and basic pipeline for the TF-IDF based model.
  • Completing the feature engineering construction and basic pipeline for the sentiment dictionary-based model.
Weeks 8–9
  • Modifying model details based on the sequential characteristics of time-based prediction.
  • Improving various models, experimenting with different economic indicators to enhance accuracy.
Week 10
  • Reading papers to understand how to integrate semantic connections into the sentiment analysis model.
  • Prepare for the midterm presentation.
  • Starting to write the Wikipedia page.
Week 11
  • Reconstructing the dataset based on the suggestions from the mid-term report.
  • Utilizing four different pre-trained models for text sentiment detection: DistilBERT-base-uncased, FinBERT, Twitter-roBERTa-base, and FinBERT-tone.
Week 12
  • Applying and tweaking other time series models to enhance model accuracy.
  • Compare the results and analyze the correlations between sentiment scores and different financial indicators.
Week 13
  • Decision fusion for enhancing model accuracy.
  • Achieving visual representation to display news trends, sentiment analysis outcomes, and predictive metrics in real-time.
  • Depending on the situation: considering implementing incremental learning.
Week 14
  • Write the report.
  • Prepare for the final presentation.

Milestone 1

  • Drafting a project proposal about the aim and objectives of the project.
  • Searching for datasets with appropriate time granularity and corresponding economic labels.

Milestone 2

  • Grasp the NLP processing workflow.
  • Complete the construction of TF-IDF and emotional indicators in news.
  • Adjust the model to improve its accuracy.

Milestone 3

  • Utilize pre-trained models for sentiment analysis.
  • Decision fusion.
  • Prepare final presentation and Wikipedia page.