• 검색 결과가 없습니다.

Large Scale Data Analysis Using Deep Learning

N/A
N/A
Protected

Academic year: 2022

Share "Large Scale Data Analysis Using Deep Learning"

Copied!
31
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

Large Scale Data Analysis Using Deep Learning

Course Introduction U Kang

Seoul National University

(2)

U Kang 2

In This Lecture

Motivation to study deep learning

Administrative information for this course

(3)

Outline

Deep Learning

Course Information

(4)

U Kang 4

Deep Learning as a Machine Learning

Machine Learning (ML)

Given x (predictor) and y (response), ML learns a function f() from data, such that y = f(x)

E.g., x = image, y = category

This learned function f() can be used to classify a new example x’

This is different from a typical programming where you want to compute y, given x and f()

Deep learning provides good performances in learning f() for many problems

Learns non-linear functions

(5)

Deep Learning as a Machine Learning

Data Size

Accuracy Deep Learning

Other machine learning

(6)

U Kang 6

Learning Tasks

Image classification

Speech recognition

Text classification

“Taxi”

Hello, dear

International politics

(7)

Main Idea

Most perception (input processing) in the brain may use one learning algorithm

Design learning methods that mimic the

brain

(8)

U Kang 8

Neurons In the Brain

(9)

Neural Network

[LeCun et al., Nature 2015]

(10)

U Kang 10

Convolutional Neural Net (CNN)

(11)

Representation Learning

Typical machine learning

Deep learning

Input Extract Output

Features

x y

Input Output

x y

Extract Features

Extract Features

Extract Features

Classifier

Classifier

(12)

U Kang 12

Human Level Object Recognition

ImageNet

~ 15M labeled images, ~ 22K categories

Top-5 Error rates (1.2 million images, 1k categories)

Non-CNN based method (~2012): 26.2 %

Alexnet (2012): 15.3 %

GoogLeNet (2014): 6.66 %

Resnet (2015): 3.57%

Human level:

5.1% error

(13)

Human-Level Face Recognition

DeepFace

97% accuracy

~ Human-level

[Taigman et al.

CVPR 2014]

(14)

U Kang 14

Computer Game

Deepmind

(15)

Computer Game

Deepmind

[Nature 2015]

https://www.youtube.com/watch?v=V1eYniJ0Rnk

(16)

U Kang 16

AlphaGo

[Silver et al., Mastering the game of Go with

deep neural networks and tree search, Nature 2016]

(17)

Neural Artist

[Gatys et al., Image Style Transfer Using Neural Networks, CVPR 2016]

(18)

U Kang 18

Machine Translation

(19)

Topics in This Course (tentative)

(ch. 1) Introduction

(ch. 2) Linear Algebra

(ch. 3) Probability and Information Theory

(ch. 4) Numerical Computation

(ch. 5) Machine Learning Basics

(ch. 6) Deep Feedforward Networks

(ch. 7) Regularization for Deep Learning

(ch. 8) Optimization for Training Deep Models

(ch. 9) Convolutional Networks

(ch. 10) Sequence Modeling: Recurrent and Recursive Nets

(ch. 11) Practical Methodology

(ch. 12) Applications

(20)

U Kang 20

Outline

Deep Learning

Course Information

(21)

M1522.001600, Spring 2017

http://datalab.snu.ac.kr/~ukang/courses/17S-DL/

Lecture slide: at least 1 hour before the lecture

TA

Beunguk Ahn (beunguk.ahn@gmail.com, 301-519)

Office hour

(Prof) Mon. 11:00 – 12:00

(TA) Tue. 14:00 – 15:00

Class meets: Mon, Wed 14:00 – 15:15, 301-101

(22)

U Kang 22

Textbook

Deep Learning (Ian Goodfellow, Yoshua Bengio, and Aaron Courville)

Available at http://www.deeplearningbook.org

(23)

Prerequisites

Basic probability

Average, std. deviation, typical distributions, MLE, …

Basic linear algebra

Rank, singular value decomposition

Programming language

Python, (C++, Java)..

Machine Learning or Artificial Intelligence

Basic understandings of machine learning

(24)

U Kang 24

Grading

10% Attendance and Quiz (random)

30% Project

30% Midterm

30% Final

(+5% Participation)

(25)

Late Policy

For all deliverables (homework, code, …)

No delay penalties, for medical etc. emergencies (bring doctor's note)

Each person has 4 'slip days' total, for the whole

semester. 10% per day of delay, after that

(26)

U Kang 26

Project

A good opportunity to solve real world problems using deep learning

Team

A group of 3-4 persons

If you cannot find your team mate, discuss with

the TA and/or instructor

(27)

Project

Topic

Improve the current status-of-the-art in deep learning

Deep learning applications

E.g., Novel applications

Deep learning implementations

E.g., Fast implementations using GPU

Method of deep learning

E.g., New regularization method

E.g., Achieve the best score in object recognition competition

We will provide some candidates, but feel free to propose your own topic

Feel free to discuss with the TA and/or the instructor

before the proposal

(28)

U Kang 28

Project

Advice

Start your project from day 1: today!

Think of the data you are interested in (e.g. image, audio, text, graph, etc.), and how to get it

Read related papers

Find your team mates

Data should be ready very soon

In the worst case, it should be ready until the end of March

If you plan to collect the data later, you might not get it until the semester ends

(29)

Project

Schedule and grading

Project proposal (due April 3) : 10%

Progress report (due May 3): 20%

Final report and presentation (due June 7) : 70%

(30)

U Kang 30

Advice

Read each chapter before class

It is ok to encounter something you don’t understand. Just mark it, and later you will understand it when you come back.

“Understand” intuitions of main ideas

Do not memorize without understanding

Improve your problem solving skills

Active participation encouraged

All questions are right; ask many questions

Use office hours (instructor and/or TA)

Enjoy this course, and study hard!

(31)

Questions?

참조

관련 문서

Preliminary Study of IR Sensor Equipped Drone-based Nuclear Power Plant Diagnosis Method using Deep Learning. Ik Jae Jin and

Na, Prediction of Hydrogen Concentration in Nuclear Power Plant Containment under Severe Accidents using Cascaded Fuzzy Neural Networks, Nuclear Engineering

100 lux 조건 하에 동영상 촬영을 진행하 고, 모든 동영상 촬영이 끝난 후 Noldus의 Ethovision(ver. 8.0)으 로 mouse가 open arms에 머문 시간과 mouse가 open

Motivation – Learning noisy labeled data with deep neural network.

A Method for Collision Avoidance of VLCC Tankers using Velocity Obstacles and Deep Reinforcement Learning.. 지도교수

A Study on the Power Metering Data Missing Correction Model Based on Deep Learning..

(Note this is idealized example. In reality graph is not bipartite and each page has both the hub and authority score). Each page starts with

An RNN model is neural network architecture using a single layer or multiple layers, consisting of circulation connections, commonly applied for learning the