Lipsync:
Creating a Unity script for Face, Blink, and Viseme morphs to use with Daz3D models.
Face blend shapes (happy, sad/lost, angry, shock), plus four others through combination.
Blink blend shapes. Auto-blink with ranged values. Min, max, stop after open/close (stare/sleep), blink once now, squint.
Viseme blend shapes. Emphasis modifier.
Speech Recognition:
Added Visemes to Speech Recognition engine.
Added/Updated Word Hashing and Word Search.
Working: Viseme lipsync "no" x 3, "yes" x 3. (simulated from speech recognition data)
Lipsync is already as good if not better than current SOTA lip syncing.
The default viseme blendshapes from Daz3D work well. However they're single frames and some of the phonemes need two (start and end). So for "oh"; "ah" and "w" are used.
Both the timing and the two-frame "oh" in "no" boost quality by quite a lot.
The audio used is from a few samples of the Google Speech Commands dataset.
"no" x 3, "yes" x 3.