The Bitext Synthetic Data consist of pre-built training data for intent detection and are provided for 20 verticals for English language (see ELRA-L0162 to ELRA-L0181). They cover the most common intents for each vertical and include a large number of example utterances for each intent, with optional entity/slot annotations for each utterance. The Manufacturing domain comprises 34 intents for English. Data is distributed as models or open text files.