This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Reinforcement Learning using Layered Morphology (RLLM)
LW
Login
Reinforcement Learning using Layered Morphology (RLLM)
6
Intergenerational Knowledge Transfer (IKT)
MiguelDev
2mo
0
5
RLLMv10 experiment
MiguelDev
2mo
0
20
A T-o-M test: 'popcorn' or 'chocolate'
MiguelDev
3mo
13
7
Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
MiguelDev
3mo
2
4
Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)
MiguelDev
4mo
0
16
GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
4mo
4
6
Research Log, RLLMv2: Phi-1.5, GPT2XL and Falcon-RW-1B as paperclip maximizers
MiguelDev
4mo
0
7
Reinforcement Learning using Layered Morphology (RLLM)
MiguelDev
6mo
0
5
An examination of GPT-2's boring yet effective glitch
MiguelDev
2mo
3