Choosing a Model

    Acoustic Models

    For docs on how to specify the Acoustic Model when making your request, see the guide here.

    An Acoustic Model learns about audio characteristics like accent, background noise, recording quality, and recording environment. We have a number of Acoustic Models available, and are always launching more, that are fine tuned for different types of data.

    Model Use Case Model Parameter Name
    Default Best for all types of data and accents (English only) "assemblyai_default"
    Australian English Tailored for Australian accented English "assemblyai_en_au"
    South African English Tailored for South African accented English "assemblyai_en_za"
    UK English Tailored for UK accented English Coming Soon
    Indian English Tailored for Indian accented English Coming Soon
    South East Asian English Tailored for South East Asian accented English Coming Soon
    Custom Tailored for your data. You can expect a large improvement compared to the default model on your data. Contact us for more info: support@assemblyai.com

    Language Models

    For docs on how to specify the Language Model when making your request, see the guide here.

    The Language Model is used to help disambiguate between probable words and phrases. For example, did you say "two" or "to", "too", or the custom word "tooh"?

    Model Use Case Model Parameter Name
    Default Best for conversational data like phone calls and meetings. "assemblyai_default"
    Media Best for broadcast news and radio and at recognizing proper nouns, this model has a much larger vocabulary than the default model. "assemblyai_media"
    Custom You can also use the API to train a custom Language Model. -