From https://www.w3.org/2022/03/24-webmachinelearning-minutes.html, it seems Ningxin got some pretty good result with "Integration of media capture transform", maybe we should also explore this.