SlideShare a Scribd company logo
1 of 22
Download to read offline
추천아 놀자 4회
영화 분류하기
곧 시작함
영화 분류하기
영화의 19가지의 장르 정보로 유사한 것끼리 분류
- 데이터 셋 : movielens의 영화 장르 정보
- 분류 알고리즘 : k-means
- 영화 장르간의 유사도는 : cosine similarity
영화 분류하기 – 데이터 셋
영화 분류하기 – 데이터 셋
영화 분류하기 – 데이터 셋
영화 분류하기 – 데이터 셋
movie id | movie title | release date | video release date | IMDb URL |
unknown | Action | Adventure | Animation ... 19개 장르 등
영화 분류하기 – 데이터 추출( 장르 정보만 )
movie title + Action | Adventure | Animation ... 19개 장르 등
1|Toy Story (1995)|0|0|0|1|1|1|0|0|0|0|0|0|0|0|0|0|0|0|0
2|GoldenEye (1995)|0|1|1|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0
3|Four Rooms (1995)|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0
4|Get Shorty (1995)|0|1|0|0|0|1|0|0|1|0|0|0|0|0|0|0|0|0|0
영화 분류하기 – 영화간의 유사도
Toy Story (1995)
|0|0|0|1|1|1|0|0|0|0|0|1|0|0|0|0|1|0|0
|0|1|1|0|0|0|0|0|0|0|0|1|0|0|0|0|1|0|0
GoldenEye
주어진데이터를K개의군집으로나누는알고리즘이다.
①나눌군집개수K를결정
②임의의군집중심으로가까운점들끼리묶음
③각각의군집에대하여평균을새로구함
④새로운평균의중심값으로가장근접한점들끼리묶음
⑤3번,4번단계를반복적으로수행하여변경이없을때까지수행
① ② ③ ④
⑤
영화 분류하기 – K-Means 클러스터링
영화 분류하기 – 클러스터링
K-Means 과정
- 데이터 셋 만들다(Vector)
[0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
[0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
[0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Shanghai Triad (Yao a yao yao
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Postino, Il (1994)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
.
.
.
[0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, From Dusk Till Dawn (1996)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, White Balloon, The (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Antonia's Line (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Angels and Insects (1995)]
[0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Muppet Treasure Island (1996)
[0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Braveheart (1995)]
영화 분류하기 – 클러스터링
K-Means 과정
- 클러스터링 개수 설정
3개
영화 분류하기 – 클러스터링
K-Means 과정
- 초기 Centro-id 결정 : 무작위 결정
[0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
[0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
[0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Postino, Il (1994)
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
.
.
.
[0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, From Dusk Till Dawn (1996)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, White Balloon, The (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Antonia's Line (1995)]
[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Angels
[0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Muppet Treasure Island (1996)
[0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Braveheart (1995)]
1번클러스터Centro-id
2번클러스터Centro-id
3번클러스터Centro-id
영화 분류하기 – 클러스터링
K-Means 과정
- Centro-id1,2,3과데이터 셋의 유사도 측정
0.0, 0.0, 0.0, 0.0, 1.0, 0.0,
Toy Story (1995)]
1번클러스터Centro-id1
2번클러스터Centro-id2
3번클러스터Centro-id3
유사도계산0.95
0.85
0.98
영화 분류하기 – 클러스터링
K-Means 과정
- 가까운 Centro-id의 클러스터링 묶음
0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)]
0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)]
0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
1번클러스터Centro-id1
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
2번클러스터Centro-id2
0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
3번클러스터Centro-id3
0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
영화 분류하기 – 클러스터링
K-Means 과정
- 클러스터링된 데이터셋의 중심값 구하기
1번클러스터Centro-id1
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
2번클러스터Centro-id2
0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
3번클러스터Centro-id3
0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
0.0, 0.3, 1.0, 0.0, 0.1,
0.0, 1.0, 1.0, 0.8, 0.0,
0.9, 0.0, 1.0, 0.0, 0.3,
영화 분류하기 – 클러스터링
K-Means 과정
- 새로운 중심값을 Centroid로 구성
1번클러스터newCentro-id1
2번클러스터newCentro-id2
3번클러스터newCentro-id3
0.0, 0.3, 1.0, 0.0, 0.1,
0.0, 1.0, 1.0, 0.8, 0.0,
0.9, 0.0, 1.0, 0.0, 0.3,
영화 분류하기 – 클러스터링
K-Means 과정
- new Centro-id로 다시 클러스터링 실행
0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)]
0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)]
0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
1번클러스터Centro-id1
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
2번클러스터Centro-id2
0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
3번클러스터Centro-id3
0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
영화 분류하기 – 클러스터링
K-Means 과정
- 클러스터링된 데이터셋의 다시 중심값 구하기
1번클러스터Centro-id1
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)]
2번클러스터Centro-id2
0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)]
3번클러스터Centro-id3
0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)]
0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)]
0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
0.0, 0.3, 1.0, 0.0, 0.1,
0.0, 1.0, 1.0, 0.8, 0.0,
0.9, 0.0, 1.0, 0.0, 0.3,
영화 분류하기 – 클러스터링
K-Means 과정
- 이전의 중심값과 새로운 중심값을 비교
- 클러스터링 반복
이전의중심값Centro-id1
0.0, 0.3, 1.0, 0.0, 0.1,
새로운중심값Centro-id1
0.0, 0.3, 1.0, 0.0, 0.1,
영화 분류하기 – 클러스터링
K-Means 과정
- 이전의 중심값과 새로운 중심값을 비교
- 클러스터 종료
이전의중심값Centro-id1
0.0, 0.3, 1.0, 0.0, 0.1,
새로운중심값Centro-id1
0.0, 0.3, 1.0, 0.0, 0.1,
영화 분류하기 – 최종 결과
Lion King, The (1994)
Snow White and the Seven Dwarfs (1937)
| All Dogs Go to Heaven 2 (1996) |
Bedknobs and Broomsticks (1971) |
Sound of Music, The (1965)
Robert A. Heinlein's The Puppet Masters (1994)
Blade Runner (1982) | Aristocats, The (1970)
Flipper (1996) | Wallace & Gromit: The Best
of Aardman Animation (1996) | Kansas City (1996)
| Homeward Bound: The Incredible Journey (1993)
| 20,000 Leagues Under the Sea (1954) | Brazil (
GoldenEye (1995)
Rumble in the Bronx (1995)
Bad Boys (1995)
Strange Days (1995)
Natural Born Killers (1994)
Stargate (1994)
Fugitive, The (1993)
Jurassic Park (1993) |
감사합니다.
방송국 : Afreecatv.com/goodvc
블로그 : goodvc78.postach.io

More Related Content

Viewers also liked

Recommendation system 소개 (1)
Recommendation system 소개 (1)Recommendation system 소개 (1)
Recommendation system 소개 (1)
주영 송
 

Viewers also liked (20)

딥러닝과 강화 학습으로 나보다 잘하는 쿠키런 AI 구현하기
딥러닝과 강화 학습으로 나보다 잘하는 쿠키런 AI 구현하기딥러닝과 강화 학습으로 나보다 잘하는 쿠키런 AI 구현하기
딥러닝과 강화 학습으로 나보다 잘하는 쿠키런 AI 구현하기
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 
Recommendation system 소개 (1)
Recommendation system 소개 (1)Recommendation system 소개 (1)
Recommendation system 소개 (1)
 
[1D4]오타 수정과 편집 기능을 가진 Android Keyboard Service 개발기
[1D4]오타 수정과 편집 기능을 가진 Android Keyboard Service 개발기[1D4]오타 수정과 편집 기능을 가진 Android Keyboard Service 개발기
[1D4]오타 수정과 편집 기능을 가진 Android Keyboard Service 개발기
 
Recommendation for dummy
Recommendation for dummyRecommendation for dummy
Recommendation for dummy
 
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
 
Hello, Recommender System
Hello, Recommender SystemHello, Recommender System
Hello, Recommender System
 
[2B4]Live Broadcasting 추천시스템
[2B4]Live Broadcasting 추천시스템  [2B4]Live Broadcasting 추천시스템
[2B4]Live Broadcasting 추천시스템
 
[1D2]아이비컨과 공유기 해킹을 통한 인도어 IOT 삽질기
[1D2]아이비컨과 공유기 해킹을 통한 인도어 IOT 삽질기[1D2]아이비컨과 공유기 해킹을 통한 인도어 IOT 삽질기
[1D2]아이비컨과 공유기 해킹을 통한 인도어 IOT 삽질기
 
[2C4]Clustered computing with CoreOS, fleet and etcd
[2C4]Clustered computing with CoreOS, fleet and etcd[2C4]Clustered computing with CoreOS, fleet and etcd
[2C4]Clustered computing with CoreOS, fleet and etcd
 
집단지성 프로그래밍 06-의사결정트리-01
집단지성 프로그래밍 06-의사결정트리-01집단지성 프로그래밍 06-의사결정트리-01
집단지성 프로그래밍 06-의사결정트리-01
 
[1B5]github first-principles
[1B5]github first-principles[1B5]github first-principles
[1B5]github first-principles
 
집단지성 프로그래밍 01-데이터마이닝 개요
집단지성 프로그래밍 01-데이터마이닝 개요집단지성 프로그래밍 01-데이터마이닝 개요
집단지성 프로그래밍 01-데이터마이닝 개요
 
[1A1]행복한프로그래머를위한철학
[1A1]행복한프로그래머를위한철학[1A1]행복한프로그래머를위한철학
[1A1]행복한프로그래머를위한철학
 
[2D4]Python에서의 동시성_병렬성
[2D4]Python에서의 동시성_병렬성[2D4]Python에서의 동시성_병렬성
[2D4]Python에서의 동시성_병렬성
 
[2A1]Line은 어떻게 글로벌 메신저 플랫폼이 되었는가
[2A1]Line은 어떻게 글로벌 메신저 플랫폼이 되었는가[2A1]Line은 어떻게 글로벌 메신저 플랫폼이 되었는가
[2A1]Line은 어떻게 글로벌 메신저 플랫폼이 되었는가
 
Deep learning 기반TmapPOI 추천기술개발사례
Deep learning 기반TmapPOI 추천기술개발사례Deep learning 기반TmapPOI 추천기술개발사례
Deep learning 기반TmapPOI 추천기술개발사례
 
20141223 머하웃(mahout) 협업필터링_추천시스템구현
20141223 머하웃(mahout) 협업필터링_추천시스템구현20141223 머하웃(mahout) 협업필터링_추천시스템구현
20141223 머하웃(mahout) 협업필터링_추천시스템구현
 
[224] backend 개발자의 neural machine translation 개발기 김상경
[224] backend 개발자의 neural machine translation 개발기 김상경[224] backend 개발자의 neural machine translation 개발기 김상경
[224] backend 개발자의 neural machine translation 개발기 김상경
 
소셜리포트(6월호) 인스타그램, 라이브리로 활용하는 방법
소셜리포트(6월호) 인스타그램, 라이브리로 활용하는 방법소셜리포트(6월호) 인스타그램, 라이브리로 활용하는 방법
소셜리포트(6월호) 인스타그램, 라이브리로 활용하는 방법
 

More from choi kyumin

More from choi kyumin (6)

개인화 추천은 어디로 가고 있는가?
개인화 추천은 어디로 가고 있는가?개인화 추천은 어디로 가고 있는가?
개인화 추천은 어디로 가고 있는가?
 
Deview2020 유저가 좋은 작품(웹툰)을 만났을때
Deview2020 유저가 좋은 작품(웹툰)을 만났을때 Deview2020 유저가 좋은 작품(웹툰)을 만났을때
Deview2020 유저가 좋은 작품(웹툰)을 만났을때
 
추천시스템 이제는 돈이 되어야 한다.
추천시스템 이제는 돈이 되어야 한다.추천시스템 이제는 돈이 되어야 한다.
추천시스템 이제는 돈이 되어야 한다.
 
Song Feature 조금더
Song Feature 조금더 Song Feature 조금더
Song Feature 조금더
 
눈으로 듣는 음악 추천 시스템-2018 if-kakao
눈으로 듣는 음악 추천 시스템-2018 if-kakao눈으로 듣는 음악 추천 시스템-2018 if-kakao
눈으로 듣는 음악 추천 시스템-2018 if-kakao
 
[데이터야놀자2107] 강남 출근길에 판교/정자역에 내릴 사람 예측하기
[데이터야놀자2107] 강남 출근길에 판교/정자역에 내릴 사람 예측하기 [데이터야놀자2107] 강남 출근길에 판교/정자역에 내릴 사람 예측하기
[데이터야놀자2107] 강남 출근길에 판교/정자역에 내릴 사람 예측하기
 

Recently uploaded

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
chumtiyababu
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 

Recently uploaded (20)

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 

추놀 4회 영화 분류하기

  • 1. 추천아 놀자 4회 영화 분류하기 곧 시작함
  • 2. 영화 분류하기 영화의 19가지의 장르 정보로 유사한 것끼리 분류 - 데이터 셋 : movielens의 영화 장르 정보 - 분류 알고리즘 : k-means - 영화 장르간의 유사도는 : cosine similarity
  • 3. 영화 분류하기 – 데이터 셋
  • 4. 영화 분류하기 – 데이터 셋
  • 5. 영화 분류하기 – 데이터 셋
  • 6. 영화 분류하기 – 데이터 셋 movie id | movie title | release date | video release date | IMDb URL | unknown | Action | Adventure | Animation ... 19개 장르 등
  • 7. 영화 분류하기 – 데이터 추출( 장르 정보만 ) movie title + Action | Adventure | Animation ... 19개 장르 등 1|Toy Story (1995)|0|0|0|1|1|1|0|0|0|0|0|0|0|0|0|0|0|0|0 2|GoldenEye (1995)|0|1|1|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0 3|Four Rooms (1995)|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0 4|Get Shorty (1995)|0|1|0|0|0|1|0|0|1|0|0|0|0|0|0|0|0|0|0
  • 8. 영화 분류하기 – 영화간의 유사도 Toy Story (1995) |0|0|0|1|1|1|0|0|0|0|0|1|0|0|0|0|1|0|0 |0|1|1|0|0|0|0|0|0|0|0|1|0|0|0|0|1|0|0 GoldenEye
  • 10. 영화 분류하기 – 클러스터링 K-Means 과정 - 데이터 셋 만들다(Vector) [0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] [0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] [0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Shanghai Triad (Yao a yao yao [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Postino, Il (1994)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] . . . [0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, From Dusk Till Dawn (1996)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, White Balloon, The (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Antonia's Line (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Angels and Insects (1995)] [0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Muppet Treasure Island (1996) [0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Braveheart (1995)]
  • 11. 영화 분류하기 – 클러스터링 K-Means 과정 - 클러스터링 개수 설정 3개
  • 12. 영화 분류하기 – 클러스터링 K-Means 과정 - 초기 Centro-id 결정 : 무작위 결정 [0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] [0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] [0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0 [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Postino, Il (1994) [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] . . . [0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, From Dusk Till Dawn (1996)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, White Balloon, The (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, Antonia's Line (1995)] [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, Angels [0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, Muppet Treasure Island (1996) [0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Braveheart (1995)] 1번클러스터Centro-id 2번클러스터Centro-id 3번클러스터Centro-id
  • 13. 영화 분류하기 – 클러스터링 K-Means 과정 - Centro-id1,2,3과데이터 셋의 유사도 측정 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, Toy Story (1995)] 1번클러스터Centro-id1 2번클러스터Centro-id2 3번클러스터Centro-id3 유사도계산0.95 0.85 0.98
  • 14. 영화 분류하기 – 클러스터링 K-Means 과정 - 가까운 Centro-id의 클러스터링 묶음 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)] 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)] 0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995) 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 1번클러스터Centro-id1 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 2번클러스터Centro-id2 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] 3번클러스터Centro-id3 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
  • 15. 영화 분류하기 – 클러스터링 K-Means 과정 - 클러스터링된 데이터셋의 중심값 구하기 1번클러스터Centro-id1 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 2번클러스터Centro-id2 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] 3번클러스터Centro-id3 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)] 0.0, 0.3, 1.0, 0.0, 0.1, 0.0, 1.0, 1.0, 0.8, 0.0, 0.9, 0.0, 1.0, 0.0, 0.3,
  • 16. 영화 분류하기 – 클러스터링 K-Means 과정 - 새로운 중심값을 Centroid로 구성 1번클러스터newCentro-id1 2번클러스터newCentro-id2 3번클러스터newCentro-id3 0.0, 0.3, 1.0, 0.0, 0.1, 0.0, 1.0, 1.0, 0.8, 0.0, 0.9, 0.0, 1.0, 0.0, 0.3,
  • 17. 영화 분류하기 – 클러스터링 K-Means 과정 - new Centro-id로 다시 클러스터링 실행 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Copycat (1995)] 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Dead Man Walking (1995)] 0.0, 0.0, 0.0, 1.0, 0.0, Richard III (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Seven (Se7en) (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995) 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 1번클러스터Centro-id1 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 2번클러스터Centro-id2 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] 3번클러스터Centro-id3 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)]
  • 18. 영화 분류하기 – 클러스터링 K-Means 과정 - 클러스터링된 데이터셋의 다시 중심값 구하기 1번클러스터Centro-id1 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Usual Suspects, The (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mighty Aphrodite (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Mr. Holland's Opus (1995)] 2번클러스터Centro-id2 0.0, 1.0, 0.0, 0.0, 0.0, Twelve Monkeys (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Babe (1995)] 3번클러스터Centro-id3 0.0, 0.0, 0.0, 0.0, 0.0, Toy Story (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, GoldenEye (1995)] 0.0, 0.0, 1.0, 0.0, 0.0, Four Rooms (1995)] 0.0, 0.0, 0.0, 0.0, 0.0, Get Shorty (1995)] 0.0, 0.3, 1.0, 0.0, 0.1, 0.0, 1.0, 1.0, 0.8, 0.0, 0.9, 0.0, 1.0, 0.0, 0.3,
  • 19. 영화 분류하기 – 클러스터링 K-Means 과정 - 이전의 중심값과 새로운 중심값을 비교 - 클러스터링 반복 이전의중심값Centro-id1 0.0, 0.3, 1.0, 0.0, 0.1, 새로운중심값Centro-id1 0.0, 0.3, 1.0, 0.0, 0.1,
  • 20. 영화 분류하기 – 클러스터링 K-Means 과정 - 이전의 중심값과 새로운 중심값을 비교 - 클러스터 종료 이전의중심값Centro-id1 0.0, 0.3, 1.0, 0.0, 0.1, 새로운중심값Centro-id1 0.0, 0.3, 1.0, 0.0, 0.1,
  • 21. 영화 분류하기 – 최종 결과 Lion King, The (1994) Snow White and the Seven Dwarfs (1937) | All Dogs Go to Heaven 2 (1996) | Bedknobs and Broomsticks (1971) | Sound of Music, The (1965) Robert A. Heinlein's The Puppet Masters (1994) Blade Runner (1982) | Aristocats, The (1970) Flipper (1996) | Wallace & Gromit: The Best of Aardman Animation (1996) | Kansas City (1996) | Homeward Bound: The Incredible Journey (1993) | 20,000 Leagues Under the Sea (1954) | Brazil ( GoldenEye (1995) Rumble in the Bronx (1995) Bad Boys (1995) Strange Days (1995) Natural Born Killers (1994) Stargate (1994) Fugitive, The (1993) Jurassic Park (1993) |