SlideShare une entreprise Scribd logo
1  sur  40
Télécharger pour lire hors ligne
Music Information
Retrieval
Music & Audio Computing Lab,
Research Center for IT Innovation,
Academia Sinica
Yi-Hsuan Yang Ph.D.
http://www.citi.sinica.edu.tw/pages/yang/
yang@citi.sinica.edu.tw
Prelude
• PI @ Music & Audio Computing Lab, Academia Sinica, since 2011
• 10420CS 573100 “Music Information Retrieval” @ NTHU, 2016
https://twtmir.wordpress.com/
2
https://teachingmir.wikispaces.com/courses
Outline
• Types of music related research
• Fundamentals of music signal processing
• New opportunities in the big data era
3
Types of Music Related Research
1. Music creation
4
https://www.youtube.com/watch?v=3OEmzI52stk
Types of Music Related Research
1. Music creation
5
https://www.youtube.com/watch?v=k1DgNfz1g_s
Types of Music Related Research
1. Music creation
6
https://www.youtube.com/watch?v=wj1r9YJ6INA
Types of Music Related Research
1. Music creation
7
http://www.inside.com.tw/2016/05/04/positive-grid-bias-head
Types of Music Related Research
1. Music creation
8
https://youtu.be/rL5YKZ9ecpg?t=50m
Types of Music Related Research
2. Music information
“analysis”
9
automatic page turner
automatic
Karaoke scoring
interactive
concert
Types of Music Related Research
2. Music information
“analysis”
10
chord recognizer music browsing assistant
Types of Music Related Research
3. Music information “retrieval”
• Search
‒ through keywords/labels (genre, instrument, emotion)
11
Types of Music Related Research
3. Music information “retrieval”
• Search
‒ through keywords/labels (genre, instrument, emotion)
‒ through audio examples (humming, audio recording)
12
Types of Music Related Research
3. Music information “retrieval”
• Match
‒ to match 1) a video clip, 2) a photo slideshow,
3) a song lyrics, or 4) a given context
‒ cross-domain retrieval
13
Types of Music Related Research
3. Music information “retrieval”
• Discover
‒ recommendation: diversity, serendipity, explanations
14
Types of Music Related Research
3. Music information “retrieval”
• Discover
‒ recommendation: diversity, serendipity, explanations
15
Types of Music Related Research
1. Music creation
• Google Magenta, Smule AutoRap, Samsung Hum-On,
Positive Grid, Yamaha Vocaloid
2. Music information analysis
• Education, data visualization
3. Music information retrieval
• Search: through keywords (genre, instrument, emotion) or
audio examples (humming or audio recording)
• Match: cross domain retrieval
• Discover: recommendation
16
Outline
• Types of music related research
• Fundamentals of music signal processing
• New opportunities in the big data era
17
Fundamentals of Music Signal Processing
• Pitch: which notes are played?
• Tempo: how fast?
• Timbre: which instrument(s)?
18
Mozart’s Variationen
(1st phrase)
Fundamentals of Music Signal Processing
Pitch ♪♪♪ ♪♪♪ ♪♪♪
Tempo ♪ ♪ ♪
Timbre ♪ ♪♪ ♪
19
Karaoke scorer chord recognizerpage turner
Fundamentals of Music Signal Processing
Pitch ♪♪♪ ♪
Tempo ♪♪♪
Timbre ♪♪♪ ♪
20
instrument
classifier
content ID Spotify running
Fundamentals of Music Signal Processing
Pitch ♪♪♪ ♪♪♪ ♪♪♪
Tempo ♪♪♪ ♪♪♪ ♪♪♪
Timbre ♪♪♪ ♪♪♪ ♪♪♪
21
similarity search
or
recommendation
music
emotion or
genre
recognizer
automatic
music video
generation
Fundamentals of Music Signal Processing
22
• Listens to music
tempo, instrumentation,
key, time signature, energy,
harmonic & timbral structures
• Reads about music
lyrics, blog posts, reviews,
playlists and discussion forums
• Learns about trends
online music behavior — who's
talking about which artists this
week, what songs are being
streamed or downloaded
• Not everything is in audio
Fundamentals of Music Signal Processing
• Let’s have a look at what we can extract from audio
anyway
• Time-domain waveform
23
Fundamentals of Music Signal Processing
• Frequency domain
representation
• Spectrogram (obtained
by Short-Time Fourier
Transform)
24
Fundamentals of Music Signal Processing
• Pitch
• Simple for monophonic
signals (almost table
lookup)
• Challenging for polyphonic
signals; known as multi-
pitch estimation (MPE)
‒ overlapping partials
‒ missing fundamentals
25
8ve
8ve
8ve
8ve
8ve
Fundamentals of Music Signal Processing
• Tempo: beats
per minute (bpm)
• Onset detection,
downbeat estimation
tempo estimation,
beat tracking,
rhythm pattern
extraction
26
energy-based spectrum-based
Fundamentals of Music Signal Processing
• Timbre: difference in time-frequency distribution
27
Fundamentals of Music Signal Processing
• Timbre: difference in time-frequency distribution
‒ odd-to-even harmonic ratio, decay rate, vibrato etc
28
piano solo human voice
Fundamentals of Music Signal Processing
• Spectrogram, or the reduced-dimension version “Mel-
spectrogram,” is usually considered as a “raw” feature
representation of music
• Can be treated as an image and then processed by
convolutional neural nets (CNN)
29
figure made by
Sander Dieleman
http://benanne.github.io/2014/
08/05/spotify-cnns.html
Fundamentals of Music Signal Processing
• Chromagram: a better “timbre-invariant” feature
representation for pitch related tasks (e.g. chord
recognition, cover song identification)
‒ merge all the frequency bins
with the same note name
(C, C#, D, D#, …)
‒ 12-dim vector for each
time frame
30
figure made by
Meinard Meuller
• Source separation can sometimes be helpful
‒ harmonic/percussion separation: given a mixture, separate
the percussive part from the harmonic part
‒ harmonic: pitch related info
‒ percussive: tempo related info
Fundamentals of Music Signal Processing
31
(a) original (b) harmonic (c) percussive
• Source separation can sometimes be helpful
‒ singing voice separation: given a mixture, separate the
singing voice from the accompaniment
Fundamentals of Music Signal Processing
32
Fundamentals of Music Signal Processing
• Pitch, tempo, timbre play different roles in different
tasks
• Spectrogram: a basic feature representation
• Multipitch estimation: for better pitch info
• Source separation: might improve the extraction for
pitch, tempo and also timbre
• Feature design (based on domain knowledge) versus
feature learning (data-driven; deep learning)
33
Outline
• Types of music related research
• Fundamentals of music signal processing
• New opportunities in the big data era
34
New Opportunities in the Big Data Era
• Big music audio data? No, only if you work for a big
company
─ not sharable due to copyright issues and business interest
─ however, audio features can be shared
─ or, start with copyright free music
35
free music
archive
New Opportunities in the Big Data Era
• Big music listening data? Yes, some of them can be
crawled from social platform websites
‒ from last.fm API, EchoNest API
‒ from Twitter: #nowplaying dataset
36
New Opportunities in the Big Data Era
• Big music text data? Yes, plenty of data
─ score, lyrics, review, playlist, tags, Wikipedia, etc
─ not everything is in audio
─ some of them are easier to get from non-audio data
37
New Opportunities in the Big Data Era
• Big sensor data? Yes, everywhere
─ sensors attached to “things” or “human beings”
─ emerging new applications
1) music generation
2) context aware music
recommendation
38
figure from pinterest
figure from ask.audio
New Opportunities in the Big Data Era
• The missing “D” in Data Science —
domain knowledge
• Music information retrieval
= musicology
+ signal processing
+ machine learning
+ others
39
Postlude
• Extension reading
‒ International Conference on Music Information Retrieval
(ISMIR)
‒ International Conference on Acoustic, Speech, and Signal
Processing (ICASSP)
‒ MIREX (MIR Evaluation eXchange)
‒ IEEE Transactions on Audio, Speech and Language
Processing (TASLP)
‒ IEEE Transactions on Multimedia (TMM)
40

Contenu connexe

Tendances

20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up
20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up
20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up
正志 井澤
 
「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ
「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ
「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ
Kurata Takeshi
 
Analysis of Warner Music Group
Analysis of Warner Music GroupAnalysis of Warner Music Group
Analysis of Warner Music Group
Federico Nardini
 

Tendances (20)

Weakly-Supervised Sound Event Detection with Self-Attention
Weakly-Supervised Sound Event Detection with Self-AttentionWeakly-Supervised Sound Event Detection with Self-Attention
Weakly-Supervised Sound Event Detection with Self-Attention
 
コサイン類似度罰則条件付き非負値行列因子分解に基づく音楽音源分離
コサイン類似度罰則条件付き非負値行列因子分解に基づく音楽音源分離コサイン類似度罰則条件付き非負値行列因子分解に基づく音楽音源分離
コサイン類似度罰則条件付き非負値行列因子分解に基づく音楽音源分離
 
20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up
20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up
20130727 ソシャゲkpi分析 tokyowebmining28_izawa_up
 
hirai23slp03.pdf
hirai23slp03.pdfhirai23slp03.pdf
hirai23slp03.pdf
 
第15回 配信講義 計算科学技術特論A(2021)
第15回 配信講義 計算科学技術特論A(2021)第15回 配信講義 計算科学技術特論A(2021)
第15回 配信講義 計算科学技術特論A(2021)
 
Microsoft Imagine Cupと深層学習を用いた音源分離技術について_言語音声ナイト
Microsoft Imagine Cupと深層学習を用いた音源分離技術について_言語音声ナイトMicrosoft Imagine Cupと深層学習を用いた音源分離技術について_言語音声ナイト
Microsoft Imagine Cupと深層学習を用いた音源分離技術について_言語音声ナイト
 
「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ
「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ
「遠隔リハビリのための多感覚XR-AI技術基盤構築と保健指導との互恵ケア連携」で目指すところ
 
Analysis of Warner Music Group
Analysis of Warner Music GroupAnalysis of Warner Music Group
Analysis of Warner Music Group
 
Ea2015 7for ss
Ea2015 7for ssEa2015 7for ss
Ea2015 7for ss
 
MP3と音声圧縮(simple)
MP3と音声圧縮(simple)MP3と音声圧縮(simple)
MP3と音声圧縮(simple)
 
調波打撃音モデルに基づく線形多チャネルブラインド音源分離
調波打撃音モデルに基づく線形多チャネルブラインド音源分離調波打撃音モデルに基づく線形多チャネルブラインド音源分離
調波打撃音モデルに基づく線形多チャネルブラインド音源分離
 
日本音響学会2017秋 ”Moment-matching networkに基づく一期一会音声合成における発話間変動の評価”
日本音響学会2017秋 ”Moment-matching networkに基づく一期一会音声合成における発話間変動の評価”日本音響学会2017秋 ”Moment-matching networkに基づく一期一会音声合成における発話間変動の評価”
日本音響学会2017秋 ”Moment-matching networkに基づく一期一会音声合成における発話間変動の評価”
 
人工知能とゲーム(前篇)
人工知能とゲーム(前篇)人工知能とゲーム(前篇)
人工知能とゲーム(前篇)
 
音響信号に対する異常音検知技術と応用
音響信号に対する異常音検知技術と応用音響信号に対する異常音検知技術と応用
音響信号に対する異常音検知技術と応用
 
音源分離 ~DNN音源分離の基礎から最新技術まで~ Tokyo bishbash #3
音源分離 ~DNN音源分離の基礎から最新技術まで~ Tokyo bishbash #3音源分離 ~DNN音源分離の基礎から最新技術まで~ Tokyo bishbash #3
音源分離 ~DNN音源分離の基礎から最新技術まで~ Tokyo bishbash #3
 
人工知能に音楽はつくれるのか? 〜 Google Magentaプロジェクトに見る人工知能による作曲方法について 〜
人工知能に音楽はつくれるのか? 〜 Google Magentaプロジェクトに見る人工知能による作曲方法について 〜人工知能に音楽はつくれるのか? 〜 Google Magentaプロジェクトに見る人工知能による作曲方法について 〜
人工知能に音楽はつくれるのか? 〜 Google Magentaプロジェクトに見る人工知能による作曲方法について 〜
 
Standards De Compression Audio Et VidéO
Standards De Compression Audio Et VidéOStandards De Compression Audio Et VidéO
Standards De Compression Audio Et VidéO
 
空気/体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調/抑圧法
空気/体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調/抑圧法空気/体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調/抑圧法
空気/体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調/抑圧法
 
ACEScg: A Common Color Encoding for Visual Effects Applications - DigiPro 2015
ACEScg: A Common Color Encoding for Visual Effects Applications - DigiPro 2015ACEScg: A Common Color Encoding for Visual Effects Applications - DigiPro 2015
ACEScg: A Common Color Encoding for Visual Effects Applications - DigiPro 2015
 
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
Investigation of Text-to-Speech based Synthetic Parallel Data for Sequence-to...
 

En vedette

En vedette (20)

孫民/從電腦視覺看人工智慧 : 下一件大事
孫民/從電腦視覺看人工智慧 : 下一件大事孫民/從電腦視覺看人工智慧 : 下一件大事
孫民/從電腦視覺看人工智慧 : 下一件大事
 
李宏毅/當語音處理遇上深度學習
李宏毅/當語音處理遇上深度學習李宏毅/當語音處理遇上深度學習
李宏毅/當語音處理遇上深度學習
 
江振宇/It's Not What You Say: It's How You Say It!
江振宇/It's Not What You Say: It's How You Say It!江振宇/It's Not What You Say: It's How You Say It!
江振宇/It's Not What You Say: It's How You Say It!
 
「資料視覺化」有志一同場次 at 2016 台灣資料科學年會
「資料視覺化」有志一同場次 at 2016 台灣資料科學年會「資料視覺化」有志一同場次 at 2016 台灣資料科學年會
「資料視覺化」有志一同場次 at 2016 台灣資料科學年會
 
闕嘉宏/我在智慧交通資料解析的失敗歷程
闕嘉宏/我在智慧交通資料解析的失敗歷程闕嘉宏/我在智慧交通資料解析的失敗歷程
闕嘉宏/我在智慧交通資料解析的失敗歷程
 
吳齊軒/漫談 R 的學習挑戰與 R 語言翻轉教室
吳齊軒/漫談 R 的學習挑戰與 R 語言翻轉教室吳齊軒/漫談 R 的學習挑戰與 R 語言翻轉教室
吳齊軒/漫談 R 的學習挑戰與 R 語言翻轉教室
 
許懷中/娛樂產業中的資料科學家 : 談資料科學於線上遊戲與職業運動之應用
許懷中/娛樂產業中的資料科學家 : 談資料科學於線上遊戲與職業運動之應用許懷中/娛樂產業中的資料科學家 : 談資料科學於線上遊戲與職業運動之應用
許懷中/娛樂產業中的資料科學家 : 談資料科學於線上遊戲與職業運動之應用
 
李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning
 
[系列活動] 機器學習速遊
[系列活動] 機器學習速遊[系列活動] 機器學習速遊
[系列活動] 機器學習速遊
 
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
 
許永真/Crowd Computing for Big and Deep AI
許永真/Crowd Computing for Big and Deep AI許永真/Crowd Computing for Big and Deep AI
許永真/Crowd Computing for Big and Deep AI
 
孔令傑/貓都學得會的手機維修資料分析
孔令傑/貓都學得會的手機維修資料分析孔令傑/貓都學得會的手機維修資料分析
孔令傑/貓都學得會的手機維修資料分析
 
姜俊宇/從資料到知識:從零開始的資料探勘
姜俊宇/從資料到知識:從零開始的資料探勘姜俊宇/從資料到知識:從零開始的資料探勘
姜俊宇/從資料到知識:從零開始的資料探勘
 
曾韵/沒有大數據怎麼辦 ? 會計師事務所的小數據科學
曾韵/沒有大數據怎麼辦 ? 會計師事務所的小數據科學曾韵/沒有大數據怎麼辦 ? 會計師事務所的小數據科學
曾韵/沒有大數據怎麼辦 ? 會計師事務所的小數據科學
 
林佳賢/資料視覺化的 20 個小訣竅
林佳賢/資料視覺化的 20 個小訣竅林佳賢/資料視覺化的 20 個小訣竅
林佳賢/資料視覺化的 20 個小訣竅
 
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
 
手把手教你 R 語言資料分析實務/張毓倫&陳柏亨
手把手教你 R 語言資料分析實務/張毓倫&陳柏亨手把手教你 R 語言資料分析實務/張毓倫&陳柏亨
手把手教你 R 語言資料分析實務/張毓倫&陳柏亨
 
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
 
彭盛韶/公私協力的公共服務 - 以資料面詮釋
彭盛韶/公私協力的公共服務 - 以資料面詮釋彭盛韶/公私協力的公共服務 - 以資料面詮釋
彭盛韶/公私協力的公共服務 - 以資料面詮釋
 
林煜軒…œ/從手機解讀行為與心理
林煜軒…œ/從手機解讀行為與心理林煜軒…œ/從手機解讀行為與心理
林煜軒…œ/從手機解讀行為與心理
 

Similaire à 楊奕軒/音樂資料檢索

Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictech
Jia Liu
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
Rui Pedro Paiva
 
Electronic Music and Software Craftsmanship: analogue patterns.
Electronic Music and Software Craftsmanship: analogue patterns.Electronic Music and Software Craftsmanship: analogue patterns.
Electronic Music and Software Craftsmanship: analogue patterns.
Guillaume Saint Etienne
 
MOODetector: Automatic Music Emotion Recognition
MOODetector: Automatic Music Emotion RecognitionMOODetector: Automatic Music Emotion Recognition
MOODetector: Automatic Music Emotion Recognition
Rui Pedro Paiva
 

Similaire à 楊奕軒/音樂資料檢索 (20)

Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017Research at MAC Lab, Academia Sincia, in 2017
Research at MAC Lab, Academia Sincia, in 2017
 
Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)Machine Learning for Creative AI Applications in Music (2018 May)
Machine Learning for Creative AI Applications in Music (2018 May)
 
machine learning x music
machine learning x musicmachine learning x music
machine learning x music
 
Introduction musictech
Introduction musictechIntroduction musictech
Introduction musictech
 
Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008Music Information Retrieval: Overview and Current Trends 2008
Music Information Retrieval: Overview and Current Trends 2008
 
Denktank 2010
Denktank 2010Denktank 2010
Denktank 2010
 
20211026 taicca 2 music generation
20211026 taicca 2 music generation20211026 taicca 2 music generation
20211026 taicca 2 music generation
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social Machines
 
MIR
MIRMIR
MIR
 
Ism2011
Ism2011Ism2011
Ism2011
 
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
Annotating Music Collections: How Content-Based Similarity Helps to Propagate...
 
"All you need is AI and music" by Keunwoo Choi
"All you need is AI and music" by Keunwoo Choi"All you need is AI and music" by Keunwoo Choi
"All you need is AI and music" by Keunwoo Choi
 
Electronic Music and Software Craftsmanship: analogue patterns.
Electronic Music and Software Craftsmanship: analogue patterns.Electronic Music and Software Craftsmanship: analogue patterns.
Electronic Music and Software Craftsmanship: analogue patterns.
 
Genre Classification and Analysis
Genre Classification and AnalysisGenre Classification and Analysis
Genre Classification and Analysis
 
MOODetector: Automatic Music Emotion Recognition
MOODetector: Automatic Music Emotion RecognitionMOODetector: Automatic Music Emotion Recognition
MOODetector: Automatic Music Emotion Recognition
 
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiOThe convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
The convergence of "hard" and "soft"in music technology, Rolf Inge Godøy, UiO
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Towards a musical Semantic Web
Towards a musical Semantic WebTowards a musical Semantic Web
Towards a musical Semantic Web
 
Music, digital culture and the new economy - Anne Jacqueline
Music, digital culture and the new economy - Anne JacquelineMusic, digital culture and the new economy - Anne Jacqueline
Music, digital culture and the new economy - Anne Jacqueline
 

Plus de 台灣資料科學年會

[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達
[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達
[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達
台灣資料科學年會
 

Plus de 台灣資料科學年會 (20)

[台灣人工智慧學校] 人工智慧技術發展與應用
[台灣人工智慧學校] 人工智慧技術發展與應用[台灣人工智慧學校] 人工智慧技術發展與應用
[台灣人工智慧學校] 人工智慧技術發展與應用
 
[台灣人工智慧學校] 執行長報告
[台灣人工智慧學校] 執行長報告[台灣人工智慧學校] 執行長報告
[台灣人工智慧學校] 執行長報告
 
[台灣人工智慧學校] 工業 4.0 與智慧製造的發展趨勢與挑戰
[台灣人工智慧學校] 工業 4.0 與智慧製造的發展趨勢與挑戰[台灣人工智慧學校] 工業 4.0 與智慧製造的發展趨勢與挑戰
[台灣人工智慧學校] 工業 4.0 與智慧製造的發展趨勢與挑戰
 
[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機
[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機
[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機
 
[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機
[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機
[台灣人工智慧學校] 開創台灣產業智慧轉型的新契機
 
[台灣人工智慧學校] 台北總校第三期結業典禮 - 執行長談話
[台灣人工智慧學校] 台北總校第三期結業典禮 - 執行長談話[台灣人工智慧學校] 台北總校第三期結業典禮 - 執行長談話
[台灣人工智慧學校] 台北總校第三期結業典禮 - 執行長談話
 
[TOxAIA台中分校] AI 引爆新工業革命,智慧機械首都台中轉型論壇
[TOxAIA台中分校] AI 引爆新工業革命,智慧機械首都台中轉型論壇[TOxAIA台中分校] AI 引爆新工業革命,智慧機械首都台中轉型論壇
[TOxAIA台中分校] AI 引爆新工業革命,智慧機械首都台中轉型論壇
 
[TOxAIA台中分校] 2019 台灣數位轉型 與產業升級趨勢觀察
[TOxAIA台中分校] 2019 台灣數位轉型 與產業升級趨勢觀察 [TOxAIA台中分校] 2019 台灣數位轉型 與產業升級趨勢觀察
[TOxAIA台中分校] 2019 台灣數位轉型 與產業升級趨勢觀察
 
[TOxAIA台中分校] 智慧製造成真! 產線導入AI的致勝關鍵
[TOxAIA台中分校] 智慧製造成真! 產線導入AI的致勝關鍵[TOxAIA台中分校] 智慧製造成真! 產線導入AI的致勝關鍵
[TOxAIA台中分校] 智慧製造成真! 產線導入AI的致勝關鍵
 
[台灣人工智慧學校] 從經濟學看人工智慧產業應用
[台灣人工智慧學校] 從經濟學看人工智慧產業應用[台灣人工智慧學校] 從經濟學看人工智慧產業應用
[台灣人工智慧學校] 從經濟學看人工智慧產業應用
 
[台灣人工智慧學校] 台中分校第二期開學典禮 - 執行長報告
[台灣人工智慧學校] 台中分校第二期開學典禮 - 執行長報告[台灣人工智慧學校] 台中分校第二期開學典禮 - 執行長報告
[台灣人工智慧學校] 台中分校第二期開學典禮 - 執行長報告
 
台灣人工智慧學校成果發表會
台灣人工智慧學校成果發表會台灣人工智慧學校成果發表會
台灣人工智慧學校成果發表會
 
[台中分校] 第一期結業典禮 - 執行長談話
[台中分校] 第一期結業典禮 - 執行長談話[台中分校] 第一期結業典禮 - 執行長談話
[台中分校] 第一期結業典禮 - 執行長談話
 
[TOxAIA新竹分校] 工業4.0潛力新應用! 多模式對話機器人
[TOxAIA新竹分校] 工業4.0潛力新應用! 多模式對話機器人[TOxAIA新竹分校] 工業4.0潛力新應用! 多模式對話機器人
[TOxAIA新竹分校] 工業4.0潛力新應用! 多模式對話機器人
 
[TOxAIA新竹分校] AI整合是重點! 竹科的關鍵轉型思維
[TOxAIA新竹分校] AI整合是重點! 竹科的關鍵轉型思維[TOxAIA新竹分校] AI整合是重點! 竹科的關鍵轉型思維
[TOxAIA新竹分校] AI整合是重點! 竹科的關鍵轉型思維
 
[TOxAIA新竹分校] 2019 台灣數位轉型與產業升級趨勢觀察
[TOxAIA新竹分校] 2019 台灣數位轉型與產業升級趨勢觀察[TOxAIA新竹分校] 2019 台灣數位轉型與產業升級趨勢觀察
[TOxAIA新竹分校] 2019 台灣數位轉型與產業升級趨勢觀察
 
[TOxAIA新竹分校] 深度學習與Kaggle實戰
[TOxAIA新竹分校] 深度學習與Kaggle實戰[TOxAIA新竹分校] 深度學習與Kaggle實戰
[TOxAIA新竹分校] 深度學習與Kaggle實戰
 
[台灣人工智慧學校] Bridging AI to Precision Agriculture through IoT
[台灣人工智慧學校] Bridging AI to Precision Agriculture through IoT[台灣人工智慧學校] Bridging AI to Precision Agriculture through IoT
[台灣人工智慧學校] Bridging AI to Precision Agriculture through IoT
 
[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達
[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達
[2018 台灣人工智慧學校校友年會] 產業經驗分享: 如何用最少的訓練樣本,得到最好的深度學習影像分析結果,減少一半人力,提升一倍品質 / 李明達
 
[2018 台灣人工智慧學校校友年會] 啟動物聯網新關鍵 - 未來由你「喚」醒 / 沈品勳
[2018 台灣人工智慧學校校友年會] 啟動物聯網新關鍵 - 未來由你「喚」醒 / 沈品勳[2018 台灣人工智慧學校校友年會] 啟動物聯網新關鍵 - 未來由你「喚」醒 / 沈品勳
[2018 台灣人工智慧學校校友年會] 啟動物聯網新關鍵 - 未來由你「喚」醒 / 沈品勳
 

Dernier

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Dernier (20)

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 

楊奕軒/音樂資料檢索

  • 1. Music Information Retrieval Music & Audio Computing Lab, Research Center for IT Innovation, Academia Sinica Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw
  • 2. Prelude • PI @ Music & Audio Computing Lab, Academia Sinica, since 2011 • 10420CS 573100 “Music Information Retrieval” @ NTHU, 2016 https://twtmir.wordpress.com/ 2 https://teachingmir.wikispaces.com/courses
  • 3. Outline • Types of music related research • Fundamentals of music signal processing • New opportunities in the big data era 3
  • 4. Types of Music Related Research 1. Music creation 4 https://www.youtube.com/watch?v=3OEmzI52stk
  • 5. Types of Music Related Research 1. Music creation 5 https://www.youtube.com/watch?v=k1DgNfz1g_s
  • 6. Types of Music Related Research 1. Music creation 6 https://www.youtube.com/watch?v=wj1r9YJ6INA
  • 7. Types of Music Related Research 1. Music creation 7 http://www.inside.com.tw/2016/05/04/positive-grid-bias-head
  • 8. Types of Music Related Research 1. Music creation 8 https://youtu.be/rL5YKZ9ecpg?t=50m
  • 9. Types of Music Related Research 2. Music information “analysis” 9 automatic page turner automatic Karaoke scoring interactive concert
  • 10. Types of Music Related Research 2. Music information “analysis” 10 chord recognizer music browsing assistant
  • 11. Types of Music Related Research 3. Music information “retrieval” • Search ‒ through keywords/labels (genre, instrument, emotion) 11
  • 12. Types of Music Related Research 3. Music information “retrieval” • Search ‒ through keywords/labels (genre, instrument, emotion) ‒ through audio examples (humming, audio recording) 12
  • 13. Types of Music Related Research 3. Music information “retrieval” • Match ‒ to match 1) a video clip, 2) a photo slideshow, 3) a song lyrics, or 4) a given context ‒ cross-domain retrieval 13
  • 14. Types of Music Related Research 3. Music information “retrieval” • Discover ‒ recommendation: diversity, serendipity, explanations 14
  • 15. Types of Music Related Research 3. Music information “retrieval” • Discover ‒ recommendation: diversity, serendipity, explanations 15
  • 16. Types of Music Related Research 1. Music creation • Google Magenta, Smule AutoRap, Samsung Hum-On, Positive Grid, Yamaha Vocaloid 2. Music information analysis • Education, data visualization 3. Music information retrieval • Search: through keywords (genre, instrument, emotion) or audio examples (humming or audio recording) • Match: cross domain retrieval • Discover: recommendation 16
  • 17. Outline • Types of music related research • Fundamentals of music signal processing • New opportunities in the big data era 17
  • 18. Fundamentals of Music Signal Processing • Pitch: which notes are played? • Tempo: how fast? • Timbre: which instrument(s)? 18 Mozart’s Variationen (1st phrase)
  • 19. Fundamentals of Music Signal Processing Pitch ♪♪♪ ♪♪♪ ♪♪♪ Tempo ♪ ♪ ♪ Timbre ♪ ♪♪ ♪ 19 Karaoke scorer chord recognizerpage turner
  • 20. Fundamentals of Music Signal Processing Pitch ♪♪♪ ♪ Tempo ♪♪♪ Timbre ♪♪♪ ♪ 20 instrument classifier content ID Spotify running
  • 21. Fundamentals of Music Signal Processing Pitch ♪♪♪ ♪♪♪ ♪♪♪ Tempo ♪♪♪ ♪♪♪ ♪♪♪ Timbre ♪♪♪ ♪♪♪ ♪♪♪ 21 similarity search or recommendation music emotion or genre recognizer automatic music video generation
  • 22. Fundamentals of Music Signal Processing 22 • Listens to music tempo, instrumentation, key, time signature, energy, harmonic & timbral structures • Reads about music lyrics, blog posts, reviews, playlists and discussion forums • Learns about trends online music behavior — who's talking about which artists this week, what songs are being streamed or downloaded • Not everything is in audio
  • 23. Fundamentals of Music Signal Processing • Let’s have a look at what we can extract from audio anyway • Time-domain waveform 23
  • 24. Fundamentals of Music Signal Processing • Frequency domain representation • Spectrogram (obtained by Short-Time Fourier Transform) 24
  • 25. Fundamentals of Music Signal Processing • Pitch • Simple for monophonic signals (almost table lookup) • Challenging for polyphonic signals; known as multi- pitch estimation (MPE) ‒ overlapping partials ‒ missing fundamentals 25 8ve 8ve 8ve 8ve 8ve
  • 26. Fundamentals of Music Signal Processing • Tempo: beats per minute (bpm) • Onset detection, downbeat estimation tempo estimation, beat tracking, rhythm pattern extraction 26 energy-based spectrum-based
  • 27. Fundamentals of Music Signal Processing • Timbre: difference in time-frequency distribution 27
  • 28. Fundamentals of Music Signal Processing • Timbre: difference in time-frequency distribution ‒ odd-to-even harmonic ratio, decay rate, vibrato etc 28 piano solo human voice
  • 29. Fundamentals of Music Signal Processing • Spectrogram, or the reduced-dimension version “Mel- spectrogram,” is usually considered as a “raw” feature representation of music • Can be treated as an image and then processed by convolutional neural nets (CNN) 29 figure made by Sander Dieleman http://benanne.github.io/2014/ 08/05/spotify-cnns.html
  • 30. Fundamentals of Music Signal Processing • Chromagram: a better “timbre-invariant” feature representation for pitch related tasks (e.g. chord recognition, cover song identification) ‒ merge all the frequency bins with the same note name (C, C#, D, D#, …) ‒ 12-dim vector for each time frame 30 figure made by Meinard Meuller
  • 31. • Source separation can sometimes be helpful ‒ harmonic/percussion separation: given a mixture, separate the percussive part from the harmonic part ‒ harmonic: pitch related info ‒ percussive: tempo related info Fundamentals of Music Signal Processing 31 (a) original (b) harmonic (c) percussive
  • 32. • Source separation can sometimes be helpful ‒ singing voice separation: given a mixture, separate the singing voice from the accompaniment Fundamentals of Music Signal Processing 32
  • 33. Fundamentals of Music Signal Processing • Pitch, tempo, timbre play different roles in different tasks • Spectrogram: a basic feature representation • Multipitch estimation: for better pitch info • Source separation: might improve the extraction for pitch, tempo and also timbre • Feature design (based on domain knowledge) versus feature learning (data-driven; deep learning) 33
  • 34. Outline • Types of music related research • Fundamentals of music signal processing • New opportunities in the big data era 34
  • 35. New Opportunities in the Big Data Era • Big music audio data? No, only if you work for a big company ─ not sharable due to copyright issues and business interest ─ however, audio features can be shared ─ or, start with copyright free music 35 free music archive
  • 36. New Opportunities in the Big Data Era • Big music listening data? Yes, some of them can be crawled from social platform websites ‒ from last.fm API, EchoNest API ‒ from Twitter: #nowplaying dataset 36
  • 37. New Opportunities in the Big Data Era • Big music text data? Yes, plenty of data ─ score, lyrics, review, playlist, tags, Wikipedia, etc ─ not everything is in audio ─ some of them are easier to get from non-audio data 37
  • 38. New Opportunities in the Big Data Era • Big sensor data? Yes, everywhere ─ sensors attached to “things” or “human beings” ─ emerging new applications 1) music generation 2) context aware music recommendation 38 figure from pinterest figure from ask.audio
  • 39. New Opportunities in the Big Data Era • The missing “D” in Data Science — domain knowledge • Music information retrieval = musicology + signal processing + machine learning + others 39
  • 40. Postlude • Extension reading ‒ International Conference on Music Information Retrieval (ISMIR) ‒ International Conference on Acoustic, Speech, and Signal Processing (ICASSP) ‒ MIREX (MIR Evaluation eXchange) ‒ IEEE Transactions on Audio, Speech and Language Processing (TASLP) ‒ IEEE Transactions on Multimedia (TMM) 40