LESSWRONGReinforcement Learning using Layered Morphology (RLLM)
LW

Reinforcement Learning using Layered Morphology (RLLM)

Dec 03, 2023 by MiguelDev

6Intergenerational Knowledge Transfer (IKT)

2mo

0

5RLLMv10 experiment

2mo

0

20A T-o-M test: 'popcorn' or 'chocolate'

3mo

13

7Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?

3mo

2

4Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)

4mo

0

16GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks

4mo

4

6Research Log, RLLMv2: Phi-1.5, GPT2XL and Falcon-RW-1B as paperclip maximizers

4mo

0

7Reinforcement Learning using Layered Morphology (RLLM)

6mo

0

5An examination of GPT-2's boring yet effective glitch

2mo

3