Class VisemeElement

SSML element for generating viseme events for lip-sync animation (Azure-specific).

The <mstts:viseme> element enables the generation of viseme (visual phoneme) events during speech synthesis. Visemes represent the visual positions of the mouth, lips, and face that correspond to spoken phonemes. This Azure Speech Service specific feature is essential for creating realistic lip-sync animations for avatars, animated characters, or virtual assistants.

When this element is included, the speech synthesizer generates time-aligned viseme data that can be used by 3D rendering engines or animation systems to synchronize facial movements with the audio output. Each viseme event includes timing information and blend shape data that defines how the face should be positioned at that moment in the speech.

Example

// Basic viseme element for front-facing animation
const viseme = new VisemeElement('redlips_front');
viseme.render();
// Output: <mstts:viseme type="redlips_front"/>

// Use with SSMLBuilder
const ssml = new SSMLBuilder({ lang: 'en-US' })
  .voice('en-US-AvaNeural')
    .viseme('redlips_front')
    .text('Hello, this text will generate viseme events for lip-sync.')
  .build();

Remarks

This is an Azure Speech Service specific extension requiring the mstts namespace
The viseme element is self-closing and cannot contain text or other elements [[11]]
Viseme events are generated for all speech following this element within the same voice
The actual viseme events are received through the Speech SDK's VisemeReceived event
Viseme data includes timing information and blend shape values for facial animation [[1]]
Common use cases include avatar animation, virtual assistants, and video game characters
The viseme type determines the format and style of the generated viseme data

See

https://docs.microsoft.com/azure/cognitive-services/speech-service/how-to-speech-synthesis-viseme Get facial position with viseme [[3]]
https://docs.microsoft.com/azure/cognitive-services/speech-service/speech-synthesis-markup Azure SSML Documentation [[11]]

Hierarchy (View Summary)

SSMLElement
- VisemeElement

Index

Constructors

constructor

Methods

escapeXml render

Constructors

constructor

new VisemeElement(type: string): VisemeElement
Creates a new VisemeElement instance.
Parameters
- type: string
  The type of viseme animation data to generate. Common values include: - redlips_front: Front-facing lip animation data for standard avatars - redlips_back: Back-facing lip animation data for alternative views The type determines the format and characteristics of the viseme events that will be generated during speech synthesis. Different types may provide different levels of detail or be optimized for specific animation systems or avatar types.
Returns VisemeElement
Example
```
// Standard front-facing viseme for avatars
const frontViseme = new VisemeElement('redlips_front');

// Back-facing viseme for special camera angles
const backViseme = new VisemeElement('redlips_back');

// For use with animation systems
const animationViseme = new VisemeElement('redlips_front');
// The generated viseme events can be used with:
// - Unity 3D avatars
// - Unreal Engine characters
// - Web-based 3D animations (Three.js, Babylon.js)
// - Ready Player Me avatars [[1]](https://github.com/met4citizen/TalkingHead)
```
Overrides SSMLElement.constructor
- Defined in elements/VisemeElement.ts:79

Methods

`Protected`escapeXml

escapeXml(text: string): string
Protected
Escapes special XML characters in text content to ensure valid XML output.

This method replaces XML special characters with their corresponding entity references to prevent XML parsing errors and potential security issues (XML injection). It should be used whenever inserting user-provided or dynamic text content into XML elements.

The following characters are escaped:
- & becomes & (must be escaped first to avoid double-escaping)
- < becomes < (prevents opening of unintended tags)
- > becomes > (prevents closing of unintended tags)
- " becomes " (prevents breaking out of attribute values)
- ' becomes ' (prevents breaking out of attribute values)
This method is marked as protected so it's only accessible to classes that extend SSMLElement, ensuring proper encapsulation while allowing all element implementations to use this essential functionality.
Parameters
- text: string
  The text content to escape
Returns string
The text with all special XML characters properly escaped
Example
```
// In a render method implementation
class TextElement extends SSMLElement {
  private text: string = 'Hello & "world" <script>';
  
  render(): string {
    // Escapes to: Hello &amp; &quot;world&quot; &lt;script&gt;
    return `<text>${this.escapeXml(this.text)}</text>`;
  }
}

// Edge cases handled correctly
this.escapeXml('5 < 10 & 10 > 5');  
// Returns: '5 &lt; 10 &amp; 10 &gt; 5'

this.escapeXml('She said "Hello"');  
// Returns: 'She said &quot;Hello&quot;'

this.escapeXml("It's a test");       
// Returns: 'It&apos;s a test'

// Prevents XML injection
this.escapeXml('</voice><voice name="evil">');  
// Returns: '&lt;/voice&gt;&lt;voice name=&quot;evil&quot;&gt;'
```
See
- https://www.w3.org/TR/xml/#syntax XML 1.0 Specification - Character Data and Markup
- https://cheatsheetseries.owasp.org/cheatsheets/XML_Security_Cheat_Sheet.html OWASP XML Security
Inherited from SSMLElement.escapeXml
- Defined in core/SSMLElement.ts:129

render

render(): string
Renders the viseme element as an SSML XML string.

Generates the Azure-specific <mstts:viseme> element with the type attribute specifying what kind of viseme data should be generated. This is a self-closing element that doesn't contain any content. When processed by the speech synthesizer, it enables the generation of viseme events that can be captured through the Speech SDK for driving facial animations.

Returns string
The XML string representation of the viseme element in the format: <mstts:viseme type="type"/>
Example
```
// Standard front-facing viseme
const front = new VisemeElement('redlips_front');
console.log(front.render());
// Output: <mstts:viseme type="redlips_front"/>

// Back-facing viseme
const back = new VisemeElement('redlips_back');
console.log(back.render());
// Output: <mstts:viseme type="redlips_back"/>

// Custom viseme type (if supported by service)
const custom = new VisemeElement('custom_avatar_type');
console.log(custom.render());
// Output: <mstts:viseme type="custom_avatar_type"/>
```
Overrides SSMLElement.render
- Defined in elements/VisemeElement.ts:116

Class VisemeElement

Example

Remarks

See

Hierarchy (View Summary)

Index

Constructors

Methods

Constructors

constructor

Parameters

Returns VisemeElement

Example

Methods

`Protected`escapeXml

Parameters

Returns string

Example

See

render

Returns string

Example

Settings

On This Page

Class VisemeElement

Example

Remarks

See

Hierarchy (View Summary)

Index

Constructors

Methods

Constructors

constructor

Parameters

Returns VisemeElement

Example

Methods

ProtectedescapeXml

Parameters

Returns string

Example

See

render

Returns string

Example

Settings

On This Page

`Protected`escapeXml