JOURNAL ARTICLE

Mining Large-Scale Music Data Sets

Daniel P. W. EllisThierry Bertin-Mahieux

Year: 2012 Journal:   Columbia Academic Commons (Columbia University)   Publisher: Columbia University

Abstract

Large collections of music audio are now common and present an interesting research opportunity: what statistical patterns and structure can be discovered across thousands or millions of examples? Unfortunately, copyright restrictions can interfere with access to such collections, so we have developed the Million Song Dataset, including derived features but not the original audio, to support commercial-scale music analysis on a common, research database. The audio features are augmented by a wide range of metadata including lyrics, tags, and listener playcounts. Now the database is ready, we have begun analyzing the content, including tasks such as identifying cover songs -- significantly harder for such a large collection.

Keywords:
Scale (ratio) Computer science Data set Data mining Artificial intelligence Geography Cartography

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.03
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Diverse Musicological Studies
Social Sciences →  Arts and Humanities →  Music
Music Technology and Sound Studies
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Mining Frequent Item sets on Large Scale Temporal Data

S. HaseenaS. ManoruthraP. HemalathaV. Akshaya

Journal:   2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA) Year: 2018 Vol: 1 Pages: 811-815
BOOK-CHAPTER

Visual Data Mining of Large Spatial Data Sets

Daniel A. KeimChristian PanseMike Sips

Lecture notes in computer science Year: 2003 Pages: 201-215
© 2026 ScienceGate Book Chapters — All rights reserved.