Extract an audio artifact from a step output.
Step output value.
First audio-like artifact.
When no object-like artifact is present.
Extract an audio artifact from a step output.