뉴스 > NEWS > DGMIF 손정우 박사 - Motor sequences, Reinforcement Learning and the Brain

작성일 : 13-06-03 14:26

DGMIF 손정우 박사 - Motor sequences, Reinforcement Learning and the Brain

글쓴이 : 최고관리자

제4회 인지과학연구소 세미나 안내

서울대 인지과학연구소(소장: 장병탁 교수)에서 DGMIF 책임연구원 손정우 박사를 모시고 6월 7일 금요일 오후1시에 아래와 같이 초청 세미나를 갖습니다. 관심있는 분들의 많은 참석 부탁 드립니다.

제목: Motor sequences, Reinforcement Learning and the Brain

연사: 손정우(대구경북첨단의료진흥재단 책임연구원)

- 일시: 2013년 6월 7일 금요일 오후 1시(다과: 12시 20분부터)

- 장소: 인문신양학술관(4동) 3층 국제회의실

- 주최: 서울대 인지과학연구소 (02-880-7734, icogsci@snu.ac.kr)

초록:

People often execute a quite complicated motor sequences effortlessly like driving a car once it is learned. Studies of several machine learning theories have been applied to processes of learning in motor sequence. One of most promising theories is the reinforcement learning (RL) in which an agent can choose optimal sequences of actions or states by maximizing accumulative rewards. Recently neural components for RL have been reported so that human and animals likely adapt RL-like learning. In this talk, I will present two experimental results showing neural substrate bridging between motor sequence and reinforcement learning. In the first experiment, I recorded neuronal activities from SMA (Supplementary Motor Area) and pre-SMA of two rhesus monkey’s brains while the monkeys were executing motor sequences by moving joysticks. In this task, the number of movements for the monkey to get reward varies from trial to trial. Therefore, values in RL context (how close to reward) were dissociated from an ordinal position in a sequence for the same directional movements. The result showed that neurons in both the areas modulate their activities more for the temporal proximity to a reward (i.e. value) than the ordinal position. In the second experiment, I inactivated the neural activities in the skelomotor areas of the internal globus pallidus (GPi; output area of the basal ganglia) by injecting muscimol while monkey performed overlearned, newly learning and random motor sequence task. It has been reported that dopaminergic neuron in the basal ganglia shows prediction error of reward. This makes many researchers believe that the basal ganglia are a neural substrate for RL by way of temporal difference learning. The inactivation of the GPi resulted in slower reaction times of movements in newly learning condition but almost no changes in overlearned and random conditions. This result shows that the basal ganglia play an important role in motor sequence learning possibly in RL fashion. In conclusion, pre-SMA and SMA area may be ones of brain areas to estimate or to store action values of motor sequences, and the basal ganglia process this information for motor sequence learning in style of RL.

손정우 박사:

Ph.D. Brain and Cognitive Sciences, University of Rochester

M.S.E. Cognitive Sciences, Seoul National University

B.S.E. Nuclear Engineering, Seoul National University

2012-: Principal Researcher, Daegu-Gyeongbuk Medical Innovation Foundation

2011-2012: Research Associate, Dept. of Neurobiology, University of Pittsburgh

2007-2011: Postdoctoral associate at Dept. of Neurobiology, University of Pittsburgh

2000: Team Manager of Technology Department, Orandif

-------------------------------------------------