Generative AI: 1. Ethics 2.CLIP: Difference between revisions
Jump to navigation
Jump to search
Line 22: | Line 22: | ||
| | | | ||
Familiarizing with Dromedary, SALMON, Llama base models. | Familiarizing with Dromedary, SALMON, Llama base models. | ||
√ | |√ | ||
|- | |- | ||
Line 30: | Line 30: | ||
Evaluation of different base models. <br> | Evaluation of different base models. <br> | ||
Choice of using Llama 2 model as our baseline. | Choice of using Llama 2 model as our baseline. | ||
|√ | |||
|- | |- | ||
Line 37: | Line 38: | ||
Red teaming dataset exploration. <br> | Red teaming dataset exploration. <br> | ||
Reading about ethical theories. <br> | Reading about ethical theories. <br> | ||
|√ | |||
|- | |- | ||
Line 43: | Line 45: | ||
| | | | ||
[https://github.com/hendrycks/ethics ETHICS dataset] discovering. | [https://github.com/hendrycks/ethics ETHICS dataset] discovering. | ||
|√ | |||
|- | |- | ||
Line 50: | Line 53: | ||
ETHICS dataset formatting for Llama fine-tuning and evaluation.<br> | ETHICS dataset formatting for Llama fine-tuning and evaluation.<br> | ||
Llama supervised model fine-tuning | Llama supervised model fine-tuning | ||
|√ | |||
|- | |- | ||
Line 57: | Line 61: | ||
Evaluation of Llama model before and after fine-tuning with ETHICS dataset.<br> | Evaluation of Llama model before and after fine-tuning with ETHICS dataset.<br> | ||
Mid-term Presentation & Start writing the Wikipedia page with the plan. | Mid-term Presentation & Start writing the Wikipedia page with the plan. | ||
|√ | |||
|- | |- | ||
Line 65: | Line 70: | ||
Re-formatting deontology dataset. <br> | Re-formatting deontology dataset. <br> | ||
Creation of the preference model. | Creation of the preference model. | ||
|√ | |||
|- | |- | ||
!scope="row"|Week 12 | !scope="row"|Week 12| | ||
| | | | ||
| | | | ||
Line 75: | Line 81: | ||
!scope="row"|Week 13 | !scope="row"|Week 13 | ||
| | |||
| | | | ||
| | | | ||
Line 83: | Line 90: | ||
| | | | ||
| Write the Wikipedia page & Final presentation | | Write the Wikipedia page & Final presentation | ||
| | |||
|- | |- | ||
|} | |} |
Revision as of 21:42, 4 December 2023
Project Plan and Milestones
Weekly Plan
Date | Task | Completion | |
---|---|---|---|
Week 4 |
Paper reading. |
√ | |
Week 5 |
Familiarizing with Dromedary, SALMON, Llama base models. |
√ | |
Week 6 |
Evaluation of different base models. |
√ | |
Week 7 |
Red teaming dataset exploration. |
√ | |
Week 8 |
ETHICS dataset discovering. |
√ | |
Week 9 |
ETHICS dataset formatting for Llama fine-tuning and evaluation. |
√ | |
Week 10 |
Evaluation of Llama model before and after fine-tuning with ETHICS dataset. |
√ | |
Week 11 |
Read about Reinforcement learning using PPO. |
√ | |
Week 12| | |||
Week 13 | |||
Week 14 | Write the Wikipedia page & Final presentation |
Milestone 1
- Choose the project subject.
- Read papers about the existing studies in this field.
- Define our research questions.
Milestone 2
- Refine our research questions.
- Explore different ethical theories.
- Find an appropriate dataset.
- Evaluate our fine-tuned supervised model.
Milestone 3
- Get our Preference and the Reinforcement learning models.
- Analyze the results.
- Write the Wikipedia page.