Because the likelihood of getting one or more extreme results by chance increases A p-value is the chance that you could get the result you got or one more extreme (i.e. further from 50% correct), would occur by chance if you were completely unable to detect a difference. In this test, you will be presented with three samples: A, B and X. A and B are consistent, one lossless and one lossy. Once the track is playing, you can switch between the samples by pressing A, B, or X. Then enter your choice by pressing the Next button. All buttons also have hotkeys. That said Spotify is like Netflix. You can seek through the tracks using the -5s, << (rewind), and +5s buttons. Bonferroni correction is applied to compensate. Although 5 trials is sufficient to estimate whether you can tell the difference between lossy and lossless, to work out which tracks you can tell the difference on will require 20 trials per sample. The ABX test on the other hand, replicates the Tidal quiz's encoding, but asks you to match two clips (A or B) to a reference clip (X). Your progress through trials and tracks is shown at the bottom. This is because if you do multiple calculations (one per track) and each has a 5% chance of being wrong, then doing 5 calculations leads to a 25% chance of being wrong. Bonferroni correction is done by dividing the cut-off (i.e. for 2 samples the cutoff becomes 0.05/2, thus the new cutoff is p<0.025). In the field of Audio, blind tests truly highlight what a listener is able to hear. Lots of people will tell you "Oh yes, the difference is huge". In this edition of our online ABX tests, you can try your ear (and your equipment) at distinguishing Spotify's streaming high quality from lossless audio. It does this using an ABX test. Each trial, X is randomly set to either A or B. You have to work out which one it is. I have no way to directly test Spotify vs Tidal in a manner where the two streaming files can be properly volume matched and tested with an ABX tool. There are many answers right here using words like "massive" and "crisper" and "depth". From a test I've seen, SSDs can start showing issues at the 250TB mark. You will be administered multiple trials for each of the five tracks used in the Tidal test. Each trial, X is randomly set to either A or B. When one can reduce the comparison to audio files, ABX testing can be automated with something like the ABX Comparator tool plugin in Foobar, meaning a single person can test himself. For example, if you get 18 of 25 correct, then the percentage will be 3.2%. This percentage is the likelihood that the result obtained, or one more extreme (i.e. further from 50% correct), would occur by chance if you were completely unable to detect a difference. You will be presented with two reference samples (A and B), and a target sample (X). The Tidal test had you try to identify which of two versions of a track was lossless for each of five tracks. The TIDAL Test was shown to be tampered with. I compared Spotify against Tidal (lossless) for a couple of months. The vast majority of people cannot pass a volume-matched, blind ABX test comparing the source lossless file and even a 128kbps AAC version of the same file. To say that you can hear a difference with any confidence this percentage typically needs to be less than 5%. An ABX listening test takes two audio samples (A and B), and provides a method for determining whether they are distinguishable to a listener. I can definitely hear the difference between 96/160/256. Spotify is, for sure (IMHO) good enough if you are using Bluetooth headphones. The ABX system is a method of comparing two choices of sensory stimuli to identify which one it is. Bonferroni correction is done by dividing the cut-off by the number of calculations: 5 tests = a cut-off of 1%. February 12, 2015 A/B testing at Spotify Ali Sarrafi Evan Shrubsole. Lots of people will tell you "Oh yes, the difference is huge". In most cases (excluding things like LDAC) good headphones are not there.

