Spectral Analysis Visualization
Spectrogram
Spectrogram: Frequency over time with energy intensity.
Waveform
Audio Properties (metadata)
Properties (metadata) | |||||
Property/Metric | Value | Threshold | Score | Source | Formula - info |
File Name | - | - | - | - | \( \text{Size:\ MB,\ GB} \) - |
File Size | - | - | - | - | \( \text{Size:\ MB,\ GB} \) - |
MIME Type | - | - | - | - | \( \text{MIME:\ Type} \) - |
Extension | - | - | - | - | \( \text{.ext} \) - |
Encoding | - | - | - | - | \( \text{String:\ UTF-8,\ etc.} \) - |
Title | - | - | - | - | \( \text{String:\ Title} \) - |
Artist | - | - | - | - | \( \text{String:\ Artist} \) - |
Album | - | - | - | - | \( \text{String:\ Album} \) - |
Genre | - | - | - | - | \( \text{String:\ Genre} \) - |
Year | - | - | - | - | \( \text{Year:\ YYYY} \) - |
Track Number | - | - | - | - | \( \text{Track\ \#:\ N} \) - |
Composer | - | - | - | - | \( \text{String:\ Composer} \) - |
Lyrics | - | - | - | - | \( \text{Lyrics:\ String} \) - |
Publisher | - | - | - | - | \( \text{String:\ Publisher} \) - |
Disc Number | - | - | - | - | \( \text{Disc\ \#:\ N} \) - |
Encoder | - | - | - | - | \( \text{String:\ Encoder} \) - |
Language | - | - | - | - | \( \text{Language:\ ISO\ 639-1} \) - |
Copyright | - | - | - | - | \( \text{Legal:\ \copyright} \) - |
ISRC | - | - | - | - | \( \text{ISRC:\ Alphanumeric\ ID} \) - |
BPM | - | - | - | - | \( \text{BPM:\ Integer} \) - |
Album Artist | - | - | - | - | \( \text{String:\ Album\ Artist} \) - |
Key | - | - | - | - | \( \text{Key:\ A,\ B,\ etc.} \) - |
Initial Key | - | - | - | - | \( \text{Key:\ A,\ B,\ etc.} \) - |
Length | - | - | - | - | \( \text{Length:\ HH:MM:SS} \) - |
Comments | - | - | - | - | \( \text{Comments:\ String} \) |
Spectral Features | |||||
Property/Metric | Value | Threshold | Score | Source | Formula - info |
Spectral Centroid | - | High: > 4000 Hz | - | - | \( C = \frac{\sum_{k} f_k A_k}{\sum_{k} A_k} \) |
Spectral Flatness | - | High: 0.8–1.0 | - | - | \( SF = \frac{(\sum_{k} A_k)^2}{\sum_{k} A_k^2} \) |
Spectral Bandwidth | - | Medium: 500–1500 Hz | - | - | \( BW = \sqrt{\frac{\sum_{k} (f_k - C)^2 A_k}{\sum_{k} A_k}} \) |
Spectral Roll-off | - | High: > 4500 Hz | - | - | \( SR = \frac{\sum_{k} A_k \cdot \mathbb{1}(f_k \geq f_{\text{roll}})}{\sum_{k} A_k} \) |
Spectral Contrast | - | High: > 30 dB | - | - | \( SC = \sum_{i} \left| E_{i} - E_{i+1} \right| \) |
Spectral Energy | - | High: > 500 | - | - | \( E = \sum_{k} A_k^2 \) |
Spectral Flux | - | High: > 0.5 | - | - | \( SF = \sum_{k} (A_k^{(n)} - A_k^{(n-1)})^2 \) |
Spectral Entropy | - | Low: < 0.5 | - | - | \( H = - \sum_{k} P_k \log P_k \) |
Temporal Features | |||||
Property/Metric | Value | Threshold | Score | Source | Formula - info |
RMS | - | Medium: 0.3–0.5 | - | - | \( RMS = \sqrt{\frac{1}{N} \sum_{n=1}^N x_n^2} \) |
Peak Amplitude | - | High: > 1.0 | - | - | \( A_{peak} = \max(|x_n|) \) |
Zero Crossing Rate | - | Low: < 10% | - | - | \( ZCR = \sum_{n=1}^{N-1} \mathbb{1}(x_n x_{n+1} < 0) \) |
Low Energy Ratio | - | Medium: 10–30% | - | - | \( LER = \frac{\sum_{i \in \text{Low}} E_i}{\sum_{i} E_i} \) |
Attack Time | - | Short: < 50 ms | - | - | \( T_{attack} = \frac{t_{90} - t_{10}}{2} \) |
Decay Time | - | Medium: 50–200 ms | - | - | \( T_{decay} = t_{90} - t_{10} \) |
Sustain Level | - | High: > 70% | - | - | \( S_{level} = \frac{\sum_{t \in \text{sustain}} A_t}{\sum_{t \in \text{full}} A_t} \) |
Release Time | - | Long: > 200 ms | - | - | \( T_{release} = t_0 - t_f \) |
Temporal Centroid | - | Medium: ~50% | - | - | \( TC = \frac{\sum_{t} t \cdot A_t}{\sum_{t} A_t} \) |
Metric | |||||
Property/Metric | Value | Threshold | Score | Source | Formula - info |
Frequency Peaks | - | Medium: 1–3 peaks | - | - | Peaks \( = \text{argmax}(X(f)) \) |
Octave Bands | - | Medium: 70–85 dB | - | - | \( \text{OctB} = 10 \log_{10} \left(\sum_{f \in \text{band}} |X(f)|^2 \right) \) |
Harmonic Ratio | - | High: > 90% | - | - | \( \text{HR} = \frac{\text{Energy}_{\text{Harmonic}}}{\text{Total Energy}} \) |
Dominant Frequency | - | High: > 1000 Hz | - | - | \( f_{\text{Dominant}} = \text{argmax}(|X(f)|) \) |
Fundamental Frequency | - | High: > 300 Hz | - | - | \( f_0 = \frac{1}{T} \) |
Harmonic Distortion | - | Low: < 10% | - | - | \( \text{THD} = \frac{\sum_{n=2}^N |H_n|^2}{|H_1|^2} \) |
Spectral Slope | - | Medium: -6 dB/octave | - | - | \( S = \frac{\Delta A}{\Delta f} \) |
Inharmonicity | - | Low: < 0.1 | - | - | \( I = \frac{\sum_{n=1}^N (f_n - nf_0)^2}{\sum_{n=1}^N f_n^2} \) |
Energy per Octave | - | High: > 50% | - | - | \( E_{\text{Octave}} = \frac{\text{Energy in Octave}}{\text{Total Energy}} \) |
Spectral Centroid | - | High: > 4000 Hz | - | - | \( C = \frac{\sum f |X(f)|}{\sum |X(f)|} \) |
Spectral Flatness | - | High: 0.8–1.0 | - | - | \( \text{Flatness} = \frac{\text{Geometric Mean}(|X(f)|)}{\text{Arithmetic Mean}(|X(f)|)} \) |
Spectral Kurtosis | - | Low: < 3 | - | - | \( K = \frac{\mu_4}{\sigma^4} \) |
Spectral Skewness | - | Low: < 0.5 | - | - | \( \text{Skewness} = \frac{\mu_3}{\sigma^3} \) |
Noise-to-Harmonic Ratio | - | Low: < 0.2 | - | - | \( \text{NHR} = \frac{\text{Energy}_{\text{Noise}}}{\text{Energy}_{\text{Harmonic}}} \) |
Energy Metrics | |||||
---|---|---|---|---|---|
Property/Metric | Value | Threshold | Formula | Source | Formula - Info |
Total Energy | - | Medium: 100–200 J | - | - | \( E_{\text{total}} = \int P(t) \, dt \) |
Mean Energy | - | Medium: 2–4 J | - | - | \( E_{\text{mean}} = \frac{1}{T} \int_{0}^{T} P(t) \, dt \) |
Energy Distribution (Bass) | - | Medium: 20–40% | - | - | \( \frac{E_{\text{bass}}}{E_{\text{total}}} \times 100 \) |
Energy Distribution (Treble) | - | Medium: 30–50% | - | - | \( \frac{E_{\text{treble}}}{E_{\text{total}}} \times 100 \) |
Dynamic Range | - | High: > 20 dB | - | - | \( \text{DR} = 20 \log_{10} \frac{P_{\text{max}}}{P_{\text{min}}} \) |
Crest Factor | - | High: > 3 | - | - | \( \text{CF} = \frac{P_{\text{peak}}}{P_{\text{rms}}} \) |
Energy Envelope Variance | - | Low: < 0.1 | - | - | \( \sigma^2 = \frac{1}{N} \sum_{i=1}^N (E_i - \mu)^2 \) |
Normalized Energy | - | High: > 0.8 | - | - | \( E_{\text{norm}} = \frac{E_{\text{current}}}{E_{\text{max}}} \) |
Transient Energy | - | Medium: 10–30% | - | - | \( \frac{E_{\text{transient}}}{E_{\text{total}}} \times 100 \) |
Sustain Energy | - | Medium: 30–60% | - | - | \( \frac{E_{\text{sustain}}}{E_{\text{total}}} \times 100 \) |
Peak-to-Average Ratio | - | High: > 3 dB | - | - | \( \text{PAR} = \frac{P_{\text{peak}}}{P_{\text{average}}} \) |
Custom Metrics | |||||
Property/Metric | Value | Threshold | Formula | Source | Formula - Info |
Copyright | - | - | - | - | \( \text{Legal\ Protection\ \copyright} \) - |
Publisher | - | - | - | - | \( \text{Publishing\ Entity} \) - |
Language | - | - | - | - | \( \text{Language:\ ISO\ Code} \) - |
ISRC | - | - | - | - | \( \text{ISRC:\ A12345AB6789} \) - |
UPC | - | - | - | - | \( \text{UPC:\ 012345678901} \) - |
Geolocation | - | - | - | - | \( \text{Coordinates:\ (lat, long)} \) - |
Region of Recording | - | - | - | - | \( \text{Region:\ Country\ Name} \) - |
Creation Date | - | - | - | - | \( \text{Date:\ YYYY-MM-DD} \) - |
Modification Date | - | - | - | - | \( \text{Date:\ YYYY-MM-DD} \) - |
License Type | - | - | - | - | \( \text{License:\ CC-BY-SA} \) - |
Creative Commons Information | - | - | - | - | \( \text{CC\ Info:\ CC0} \) - |
Album Art URL | - | - | - | - | \( \text{Image:\ URL} \) - |
Original Source URL | - | - | - | - | \( \text{URL:\ https://example.com} \) - |
Genre Subcategory | - | - | - | - | \( \text{Subcategory:\ Pop-Rock} \) - |
Explicit Content | - | - | - | - | \( \text{Explicit:\ Yes/No} \) - |
Keywords | - | - | - | - | \( \text{Keywords:\ Tag1,Tag2,...} \) - |
Geotag (Latitude, Longitude) | - | - | - | - | \( \text{Geo:\ (lat,long)} \) - |
Recording Environment | - | - | - | - | \( \text{Environment:\ Indoor/Outdoor} \) - |
Audio Watermark | - | - | - | - | \( \text{Watermark:\ Present/Absent} \) - |
Search for / Drag & Drop Your Files Here
Or click the button below to select files.
Accepted formats: MP3, WAV, FLAC, AAC, OGG, M4A, WMA, AIFF, ALAC, etc.