librosa.load — librosa 0.10.2 documentation

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

爱吹牛的小马驹 · 关于聘任新一届天津市规划委员会规划设计委员会 ...· 1 月前 ·

发呆的抽屉 · linux使用make命令安装文件夹 • ...· 3 月前 ·

考研的麦片 · 高质量开新局！大庆油田首季完成油气产量当量1 ...· 5 月前 ·

腼腆的绿豆 · 什么是套接字？Socket基本介绍-CSDN博客· 6 月前 ·

无邪的八宝粥 · 求教,这个C++问题应该怎么解决?可能是关于 ...· 8 月前 ·

librosa. load ( path , * , sr=22050 , mono=True , offset=0.0 , duration=None , dtype=<class 'numpy.float32'> , res_type='soxr_hq' ) [source] 

Load an audio file as a floating point time series.

Audio will be automatically resampled to the given rate (default sr=22050 ).

To preserve the native sampling rate of the file, use sr=None .

Parameters :

path string, int, pathlib.Path, soundfile.SoundFile, audioread object, or file-like object

path to the input file.

Any codec supported by soundfile or audioread will work.

Any string file paths, or any object implementing Python’s file interface (e.g. pathlib.Path ) are supported as path .

If the codec is supported by soundfile , then path can also be an open file descriptor (int) or an existing soundfile.SoundFile object.

Pre-constructed audioread decoders are also supported here, see the example below. This can be used, for example, to force a specific decoder rather than relying upon audioread to select one for you.

Warning

audioread support is deprecated as of version 0.10.0. audioread support be removed in version 1.0.

sr number > 0 [scalar]

target sampling rate

‘None’ uses the native sampling rate

mono bool

convert signal to mono

offset float

start reading after this time (in seconds)

duration float

only load up to this much audio (in seconds)

dtype numeric type

data type of y

res_type str

resample type (see note)

By default, this uses soxr ’s high-quality mode (‘HQ’).

For alternative resampling modes, see resample

audioread may truncate the precision of the audio data to 16 bits.

See Advanced I/O Use Cases for alternate loading methods.

Returns :

y np.ndarray [shape=(n,) or (…, n)]

audio time series. Multi-channel is supported.

sr number > 0 [scalar]

sampling rate of y

     >>> # Load an ogg vorbis file>>> filename=librosa.ex('trumpet')>>> y,sr=librosa.load(filename)array([-1.407e-03, -4.461e-04, ..., -3.042e-05,  1.277e-05],      dtype=float32)22050>>> # Load a file and resample to 11 KHz>>> filename=librosa.ex('trumpet')>>> y,sr=librosa.load(filename,sr=11025)array([-8.746e-04, -3.363e-04, ..., -1.301e-05,  0.000e+00],      dtype=float32)11025>>> # Load 5 seconds of a file, starting 15 seconds in>>> filename=librosa.ex('brahms')>>> y,sr=librosa.load(filename,offset=15.0,duration=5.0)array([0.146, 0.144, ..., 0.128, 0.015], dtype=float32)22050>>> # Load using an already open SoundFile object>>> importsoundfile>>> sfo=soundfile.SoundFile(librosa.ex('brahms'))>>> y,sr=librosa.load(sfo)>>> # Load using an already open audioread object
>>> import audioread.ffdec  # Use ffmpeg decoder
>>> aro = audioread.ffdec.FFmpegAudioFile(librosa.ex('brahms'))
>>> y, sr = librosa.load(aro)