You can use this block:
Set the thumbnail to a voice message image

You probably can use this block to detect that a message has been clicked:
Make a simple if condition, if with file then set the player component source ( you can change it to the exoplayer, or any extension ).