Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing documentation - how to understand output of aubio pitch and aubio notes? #364

Open
mepc36 opened this issue May 23, 2022 · 2 comments

Comments

@mepc36
Copy link

mepc36 commented May 23, 2022

Apologies for asking a question that is probably answered somewhere in the docs, but between these sites I cannot find the answer:

https://aubio.org/manual/latest/cli.html#aubio
https://aubio.org/documentation
https://github.com/aubio/aubio/issues/new

I do not understand what the output of the aubio pitch and aubio notes commands are.

audio file: "I Love Kanye", by Kanye West (on YouTube here)
commands: aubio pitch i-love-kanye.mp3 and aubio notes i-love-kanye.mp3

aubio pitch sample output:

(base) martin:Desktop martinconnor$ aubio pitch 09.\ I\ Love\ Kanye\ copy.mp3 
0.000000	0.000000
0.005805	0.000000
0.011610	2255.677002
0.017415	1234.997070
0.023220	771.823730
0.029025	469.961548
0.034830	2127.355225
0.040635	2088.092529
0.046440	909.061890
0.052245	908.138550
0.058050	467.406403
0.063855	465.747223
0.069660	462.281494
0.075465	459.564240
0.081270	1834.805786
0.087075	1773.861328
0.092880	1745.297363
0.098685	910.913269
0.104490	831.457031
0.110295	829.726196
0.116100	832.404480
0.121905	0.000000
0.127710	829.331848
0.133515	834.354248
0.139320	855.222351
0.145125	0.000000
0.150930	321.989532
0.156735	70.879410
0.162540	1073.676270
0.168345	78.648453
0.174150	78.694801
0.179955	144.124207
0.185760	143.980957
0.191565	169.961380
0.197370	202.052338
0.203175	201.367661
0.208980	703.161194
0.214785	229.030853
0.220590	232.336227
0.226395	236.057510
0.232200	241.617416
0.238005	251.036484
0.243810	256.490662
0.249615	262.420593
0.255420	271.795532
0.261224	274.221069
0.267029	275.867157
0.272834	276.927521
0.278639	277.184906
0.284444	276.553833
0.290249	275.070312
0.296054	272.945099
0.301859	270.738770
0.307664	268.701660
0.313469	266.627899
0.319274	264.268524
0.325079	262.173401
0.330884	260.317780
0.336689	255.506668
0.342494	253.204224
0.348299	251.611511
0.354104	249.555466
0.359909	246.736282
0.365714	242.597534
0.371519	238.464157
0.377324	235.591522
0.383129	232.148575
0.388934	228.528168
0.394739	226.172043
0.400544	224.810959
0.406349	224.243057
0.412154	223.427597
0.417959	214.134247
0.423764	213.898743
0.429569	6259.341797
0.435374	6553.820312
0.441179	6790.392578
0.446984	6668.113770
0.452789	6484.388184
0.458594	6449.885742
0.464399	6589.834961
0.470204	6862.158203
0.476009	6853.378418
0.481814	6576.416504
0.487619	6399.205566
0.493424	6438.535645
0.499229	6269.137207
0.505034	6021.243652
0.510839	5944.690918
0.516644	220.340805
0.522449	232.385193
0.528254	232.202774
0.534059	5880.576660
0.539864	216.217987
0.545669	297.520325
0.551474	377.006073
0.557279	174.714264
0.563084	175.741562
0.568889	530.153931

I do not understand what these two columns of numbers are. The one on the left seems to be timestamps for the whole song, but what is the one on the right?

This is the output for aubio notes:

(base) martin:Desktop martinconnor$ aubio notes 09.\ I\ Love\ Kanye\ copy.mp3 
0.075465	
70.000000	0.075465	0.116100	
50.000000	0.197370	0.371519	
59.000000	0.371519	0.406349	
116.000000	0.458594	0.464399	
72.000000	0.586304	0.609524	
55.000000	0.795283	0.864943	
57.000000	0.864943	0.998458	
90.000000	1.126168	1.375782	
51.000000	1.375782	1.428027	
71.000000	1.428027	1.480272	
69.000000	1.480272	1.503492	
99.000000	1.596372	1.607982	
112.000000	1.712472	1.747302	
111.000000	1.747302	1.857596	
57.000000	1.933061	1.985306	
57.000000	1.985306	2.060771	
52.000000	2.118821	2.153651	
52.000000	2.165261	2.316190	
54.000000	2.316190	2.351020	
55.000000	2.351020	2.536780	
92.000000	2.681905	2.948934	
53.000000	2.948934	3.070839	
98.000000	3.163719	3.192744	
111.000000	3.314649	3.512018	
58.000000	3.512018	3.541043	
51.000000	3.686168	3.755828	
113.000000	3.755828	3.877732	
114.000000	3.877732	4.150567	
91.000000	4.237642	4.324717	
54.000000	4.324717	4.423401	
53.000000	4.516281	4.556916	
52.000000	4.556916	4.620771	
99.000000	4.707846	4.736871	
119.000000	4.835556	4.928435	
59.000000	4.928435	5.044535	
56.000000	5.044535	5.207075	
53.000000	5.207075	5.270930	
51.000000	5.270930	5.305760	
55.000000	5.421859	5.630839	
112.000000	5.688889	5.781769	
55.000000	5.781769	6.019773	
53.000000	6.019773	6.054603	
53.000000	6.054603	6.112653	
98.000000	6.211338	6.269388	
70.000000	6.356463	6.408707	
70.000000	6.408707	6.466757	
60.000000	6.466757	6.617687	
54.000000	6.762812	6.948571	
55.000000	6.948571	6.983401	
57.000000	6.983401	7.128526	
82.000000	7.279456	7.575510	
67.000000	7.575510	7.680000	
99.000000	7.767075	7.819320	
53.000000	7.912200	8.034104	
57.000000	8.092154	8.405624	
53.000000	8.405624	8.498503	
58.000000	8.498503	8.533333	
60.000000	8.533333	8.736508	
55.000000	8.777143	8.840998	
97.000000	8.840998	9.119637	
54.000000	9.119637	9.218322	
99.000000	9.305397	9.351837	
56.000000	9.438912	9.508571	
84.000000	9.601451	9.717551	
58.000000	9.793016	9.827846	
68.000000	9.827846	9.868481	
113.000000	9.903311	10.077460	
60.000000	10.077460	10.280635	
54.000000	10.303855	10.367710	
81.000000	10.367710	10.623129	
54.000000	10.623129	10.657959	
54.000000	10.657959	10.716009	
99.000000	10.808889	10.837914	
113.000000	10.936599	11.041088	
60.000000	11.041088	11.110748	
57.000000	11.162993	11.389388	
57.000000	11.389388	11.563537	
60.000000	11.563537	11.807347	
58.000000	11.807347	11.813152	
96.000000	11.917642	12.155646	
56.000000	12.155646	12.248526	
56.000000	12.341406	12.405261	
12.550385	
60.000000	12.550385	12.689705	
61.000000	12.689705	12.724535	
60.000000	12.724535	12.770975	
117.000000	12.770975	12.933515	
116.000000	12.933515	13.165714	
61.000000	13.165714	13.212154	
98.000000	13.392109	13.676553	
56.000000	13.676553	13.746213	
52.000000	13.746213	13.781043	
99.000000	13.868118	13.908753	
112.000000	14.030658	14.100317	
61.000000	14.204807	14.262857	
54.000000	14.425397	14.442812	
59.000000	14.599546	14.704036	
118.000000	14.791111	14.895601	
96.000000	14.895601	15.029116	
54.000000	15.029116	15.162630	
54.000000	15.162630	15.290340	
99.000000	15.371610	15.441270	
79.000000	15.551565	15.574785	
71.000000	15.702494	15.795374	
15.946304	
115.000000	15.946304	16.062404	
59.000000	16.062404	16.172698	
100.000000	16.259773	16.277188	
80.000000	16.364263	16.399093	
80.000000	16.399093	16.462948	
60.000000	16.590658	16.631293	
124.000000	16.747392	16.967982	
60.000000	16.967982	17.002812	
60.000000	17.002812	17.037642	
58.000000	17.142132	17.194376	
56.000000	17.194376	17.205986	
55.000000	17.327891	17.449796	
53.000000	17.449796	17.519456	
58.000000	17.519456	17.554286	
60.000000	17.554286	17.606531	
98.000000	17.780680	18.082540	
54.000000	18.082540	18.192834	
99.000000	18.279909	18.337959	
122.000000	18.448254	18.581769	
71.000000	18.651429	18.726893	
56.000000	18.825578	18.999728	
50.000000	18.999728	19.075193	
59.000000	19.075193	19.237732	
58.000000	19.237732	19.260952	
97.000000	19.382857	19.678912	
53.000000	19.678912	19.760181	
99.000000	19.853061	19.887891	
60.000000	19.980771	20.050431	
60.000000	20.160726	20.201361	
59.000000	20.201361	20.271020	
120.000000	20.271020	20.305850	
71.000000	20.538050	20.729615	
60.000000	20.729615	20.787664	
99.000000	20.886349	21.141769	
54.000000	21.141769	21.240454	
53.000000	21.240454	21.298503	
114.000000	21.321723	21.379773	
98.000000	21.379773	21.408798	
74.000000	21.507483	21.704853	
62.000000	21.704853	21.762902	
71.000000	21.855782	22.041542	
60.000000	22.041542	22.076372	
61.000000	22.076372	22.082177	
60.000000	22.215692	22.250522	
60.000000	22.250522	22.418866	
61.000000	22.418866	22.430476	
59.000000	22.575601	22.627846	
58.000000	22.627846	22.790385	
60.000000	22.790385	22.987755	
60.000000	22.987755	23.132880	
61.000000	23.132880	23.173515	
63.000000	23.173515	23.237370	
117.000000	23.237370	23.550839	
60.000000	23.550839	23.707574	
63.000000	23.707574	23.742404	
98.000000	23.864308	23.899138	
63.000000	23.899138	24.009433	
63.000000	24.113923	24.270658	
120.000000	24.288073	24.363537	
71.000000	24.363537	24.409977	
48.000000	24.491247	24.526077	
79.000000	24.526077	24.682812	
62.000000	24.682812	24.735057	
112.000000	24.735057	24.752472	
54.000000	24.856961	25.083356	
54.000000	25.083356	25.292336	
50.000000	25.367800	25.443265	
55.000000	25.443265	25.698685	
51.000000	25.698685	25.745125	
50.000000	25.745125	25.779955	
64.000000	25.884444	25.954104	
49.000000	26.041179	26.081814	
79.000000	26.081814	26.261769	
61.000000	26.261769	26.308209	
112.000000	26.308209	26.325624	
54.000000	26.406893	26.627483	
56.000000	26.627483	26.807438	
51.000000	26.877098	26.958367	
94.000000	26.958367	27.207982	
52.000000	27.207982	27.289252	
100.000000	27.376327	27.434376	
72.000000	27.515646	27.573696	
58.000000	27.573696	27.614331	
60.000000	27.614331	27.747846	
62.000000	27.747846	27.823311	
61.000000	27.933605	28.177415	
60.000000	28.310930	28.427029	
55.000000	28.427029	28.485079	
62.000000	28.485079	28.554739	
55.000000	28.670839	28.711474	
62.000000	28.711474	28.914649	
58.000000	28.914649	29.059773	
61.000000	29.059773	29.135238	
62.000000	29.135238	29.152653	
62.000000	29.239728	29.350023	
58.000000	29.419683	29.518367	
59.000000	29.611247	29.657687	
60.000000	29.657687	29.797007	
123.000000	29.855057	29.878277	
60.000000	29.982766	30.238186	
57.000000	30.238186	30.290431	
55.000000	30.290431	30.319456	
99.000000	30.429751	30.481995	
72.000000	30.627120	30.789660	
61.000000	30.789660	30.853515	
54.000000	30.969615	31.010249	
98.000000	31.155374	31.277279	
59.000000	31.277279	31.480454	
59.000000	31.480454	31.550113	
99.000000	31.654603	31.677823	
56.000000	31.782313	31.822948	
57.000000	31.822948	31.898413	
58.000000	31.910023	31.962268	
59.000000	31.962268	32.002902	
58.000000	32.002902	32.043537	
118.000000	32.043537	32.362812	
90.000000	32.461497	32.554376	
76.000000	32.554376	32.606621	
74.000000	32.606621	32.670476	
57.000000	32.740136	32.809796	
59.000000	32.809796	32.879456	
70.000000	33.076825	33.326440	
57.000000	33.326440	33.384490	
100.000000	33.483175	33.512200	
59.000000	33.634104	33.767619	
59.000000	33.767619	33.906939	
59.000000	33.906939	33.941769	
116.000000	33.976599	33.994014	
57.000000	34.098503	34.127528	
60.000000	34.295873	34.493243	
71.000000	34.609342	34.870567	
58.000000	34.870567	34.922812	
56.000000	35.015692	35.067937	
62.000000	35.201451	35.230476	
64.000000	35.311746	35.416236	
61.000000	35.543946	35.631020	
60.000000	35.631020	35.723900	
117.000000	35.723900	35.822585	
65.000000	35.822585	36.008345	
94.000000	36.124444	36.600454	
65.000000	36.600454	36.658503	
75.000000	36.757188	36.826848	
67.000000	36.826848	36.861678	
122.000000	36.861678	36.913923	
123.000000	36.913923	37.035828	
60.000000	37.035828	37.157732	
58.000000	37.157732	37.186757	
70.000000	37.331882	37.569887	
60.000000	37.569887	37.604717	
111.000000	37.604717	37.610522	
59.000000	37.720816	37.970431	
58.000000	37.970431	38.074921	
101.000000	38.161995	38.225850	
68.000000	38.307120	38.353560	
68.000000	38.353560	38.429025	
115.000000	38.429025	38.463855	
63.000000	38.568345	38.614785	
60.000000	38.614785	38.876009	
54.000000	38.876009	38.934059	
58.000000	38.934059	39.131429	
97.000000	39.264943	39.537778	
57.000000	39.537778	39.648073	
100.000000	39.735147	39.775782	
58.000000	39.880272	39.915102	
56.000000	39.915102	39.978957	
59.000000	39.978957	40.147302	
61.000000	40.147302	40.292426	
57.000000	40.437551	40.472381	
61.000000	40.472381	40.576871	
61.000000	40.704580	40.774240	
61.000000	40.774240	40.838095	
56.000000	40.838095	40.901950	
57.000000	40.901950	40.954195	
96.000000	41.186395	41.232834	
57.000000	41.232834	41.296689	
57.000000	41.296689	41.517279	
57.000000	41.517279	41.639184	
101.000000	41.749478	41.790113	
41.929433	
54.000000	41.929433	41.987483	
52.000000	41.987483	42.091973	
52.000000	42.091973	42.132608	
112.000000	42.225488	42.324172	
56.000000	42.324172	42.573787	
50.000000	42.573787	42.672472	
46.000000	42.672472	42.707302	
46.000000	42.829206	43.020771	
50.000000	43.032381	43.334240	
57.000000	43.345850	43.485170	
63.000000	43.485170	43.520000	
52.000000	43.525805	43.665125	
55.000000	43.665125	43.670930	
78.000000	43.798639	43.827664	
56.000000	43.850884	43.932154	
56.000000	43.932154	44.019229	
52.000000	44.036644	44.193379	
51.000000	44.193379	44.413968	
45.008980	

What are these three columns of numbers? Thank you!

@mepc36
Copy link
Author

mepc36 commented May 24, 2022

I figured out the output format for aubio pitch - the left column is a timeslice, and the right column is the frequency in Hertz. This can be observed by using the example file from freesound here that plays Middle C and getting the following output:

(base) martin:example-objects martinconnor$ aubio pitch middle-c-freesound.mp3 
0.000000	760.443237
0.005805	753.561523
0.011610	750.607727
0.017415	374.043488
0.023220	248.188065
0.029025	248.194290
0.034830	248.653305
0.040635	248.997147
0.046440	249.103348
0.052245	249.177567
0.058050	249.168411
0.063855	248.922058
0.069660	248.481613
0.075465	248.609482
0.081270	255.844818
0.087075	259.823334
0.092880	259.865875
0.098685	260.903503
0.104490	262.111084
0.110295	262.855835
0.116100	262.939941
0.121905	262.992096
0.127710	263.292694
0.133515	262.827057
0.139320	261.837555
0.145125	261.382599
0.150930	261.370667
0.156735	260.950623
0.162540	260.443817
0.168345	260.724701
0.174150	261.839294
0.179955	263.056152
0.185760	263.492157
0.191565	262.836243
0.197370	261.910156
0.203175	261.561890
0.208980	261.730042
0.214785	262.104340
0.220590	262.396515
0.226395	262.412720
0.232200	262.217163
0.238005	262.020905
0.243810	262.037811
0.249615	262.124847
0.255420	262.118286
0.261224	261.967010
0.267029	261.629059
0.272834	261.470490
0.278639	261.925537
0.284444	262.805725
0.290249	263.237396
0.296054	262.746185
0.301859	262.000793
0.307664	261.624939
0.313469	261.568451
0.319274	261.767212
0.325079	262.137421
0.330884	262.338013
0.336689	262.149902
0.342494	261.839050
0.348299	261.677460
0.354104	261.757812
0.359909	262.039886
0.365714	262.227570
0.371519	262.142517
0.377324	261.905579
0.383129	261.746094
0.388934	261.771210
0.394739	261.854370
0.400544	261.836884
0.406349	261.793030
0.412154	261.752197
0.417959	261.656799
0.423764	261.701843
0.429569	262.027344
0.435374	262.303711
0.441179	262.295624
0.446984	262.100861
0.452789	261.856598
0.458594	261.681763
0.464399	261.678497
0.470204	261.928802
0.476009	262.204254
0.481814	262.071533
0.487619	261.548981
0.493424	261.272430
0.499229	261.533966
0.505034	261.861206
0.510839	262.019623
0.516644	262.158661
0.522449	262.054047
0.528254	261.652832
0.534059	261.422394
0.539864	261.532104
0.545669	261.764038
0.551474	262.059082
0.557279	262.127808
0.563084	261.588684
0.568889	260.861237
0.574694	260.551514
0.580499	260.864288
0.586304	261.581757
0.592109	262.051575
0.597914	261.676544
0.603719	261.112000
0.609524	260.718353
0.615329	260.644318
0.621134	261.223633
0.626939	262.130005
0.632744	262.481262
0.638549	261.933899
0.644354	260.984100
0.650159	260.220337
0.655964	261.339325
0.661769	262.028839
0.667574	262.192719
0.673379	261.274109
0.679184	260.386749
0.684989	260.212616

However, I'm still wondering about aubio notes.

@mepc36
Copy link
Author

mepc36 commented May 24, 2022

Having noticed that the lefthand column of the aubio notes numbers are in a range from 0 to 127, I've figured out that it represents MIDI velocity. Again, the two righthand columns are for timestamps, which was originally tough to figure out because their slices were so fine.

I think this would be great info to add in the documentation somewhere!

@mepc36 mepc36 changed the title How to understand output of aubio pitch and aubio notes? Missing documentation - how to understand output of aubio pitch and aubio notes? May 24, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant