Подписаться
Scott Garrabrant
Scott Garrabrant
University of California, Los Angeles
Подтвержден адрес электронной почты в домене garrabrant.com
Название
Процитировано
Процитировано
Год
Categorizing variants of Goodhart's Law
D Manheim, S Garrabrant
arXiv preprint arXiv:1803.04585, 2018
104*2018
Risks from learned optimization in advanced machine learning systems
E Hubinger, C van Merwijk, V Mikulik, J Skalse, S Garrabrant
arXiv preprint arXiv:1906.01820, 2019
972019
Logical induction
S Garrabrant, T Benson-Tilsen, A Critch, N Soares, J Taylor
arXiv preprint arXiv:1609.03543, 2016
53*2016
Embedded agency
A Demski, S Garrabrant
arXiv preprint arXiv:1902.09469, 2019
342019
Pattern avoidance is not P-recursive
S Garrabrant, I Pak
arXiv preprint arXiv:1505.06508, 2015
30*2015
Using TPA to count linear extensions
J Banks, SM Garrabrant, ML Huber, A Perizzolo
Journal of Discrete Algorithms 51, 1-11, 2018
19*2018
Risks from learned optimization in advanced machine learning systems. arXiv
E Hubinger, C van Merwijk, V Mikulik, J Skalse, S Garrabrant
arXiv preprint arXiv:1906.01820, 2019
132019
Words in Linear Groups, Random Walks, Automata and P-Recursiveness
S Garrabrant, I Pak
arXiv preprint arXiv:1502.06565, 2015
132015
Counting with irrational tiles
S Garrabrant, I Pak
arXiv preprint arXiv:1407.8222, 2014
122014
Asymptotic convergence in online learning with unbounded delays
S Garrabrant, N Soares, J Taylor
arXiv preprint arXiv:1604.05280, 2016
112016
Embedded agency
S Garrabrant, A Demski
AI Alignment Forum, 2018
72018
Asymptotic logical uncertainty and the Benford test
S Garrabrant, T Benson-Tilsen, S Bhaskar, A Demski, J Garrabrant, ...
Artificial General Intelligence: 9th International Conference, AGI 2016, New …, 2016
72016
Upper bounds in the Ohtsuki–Riley–Sakuma partial order on 2-bridge knots
SM Garrabrant, J Hoste, PD Shanahan
Journal of Knot Theory and Its Ramifications 21 (09), 1250084, 2012
72012
Goodhart taxonomy
S Garrabrant
Alignment Forum. URL: https://www. alignmentforum. org/posts …, 2017
52017
Two major obstacles for logical inductor decision theory
S Garrabrant
Intelligent Agents Foundation Forum, 2017
52017
Inductive Coherence
S Garrabrant, B Fallenstein, A Demski, N Soares
arXiv preprint arXiv:1604.05288, 2016
4*2016
Cofinite Induced Subgraphs of Impartial Combinatorial Games: An Analysis of CIS-Nim
SM Garrabrant, EJ Friedman, AS Landsberg
INTEGERS 13, 2, 2013
32013
Geometric analysis of a generalized Wythoff game
E Friedman, SM Garrabrant, IK PHIPPS-MORGAN, AS LANDSBERG, ...
Games of No Chance 5 5, 343, 2019
22019
P-recursive integer sequences and automata theory
SM Garrabrant
University of California, Los Angeles, 2015
12015
Temporal Inference with Finite Factored Sets
S Garrabrant
arXiv preprint arXiv:2109.11513, 2021
2021
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–20