Join the discussion
Question 1/21
What type of task is viewed when using the Seq2Seq model in speech recognition?
Correct Answer: D
The Seq2Seq (sequence-to-sequence) model converts an input sequence into an output sequence. In speech recognition, the input is a sequence of acoustic features, and the output is a sequence of text tokens. This is essentially aclassification taskbecause each output token is classified into a predefined vocabulary set.
Although the output is sequential, each position in the output sequence involves a classification decision.
Exact Extract from HCIP-AI EI Developer V2.5:
"In speech recognition, Seq2Seq models classify each output token from a fixed vocabulary, making the overall problem a sequence of classification tasks." Reference:HCIP-AI EI Developer V2.5 Official Study Guide - Chapter: Sequence Models in Speech Recognition
Although the output is sequential, each position in the output sequence involves a classification decision.
Exact Extract from HCIP-AI EI Developer V2.5:
"In speech recognition, Seq2Seq models classify each output token from a fixed vocabulary, making the overall problem a sequence of classification tasks." Reference:HCIP-AI EI Developer V2.5 Official Study Guide - Chapter: Sequence Models in Speech Recognition
Add Comments
- Other Question (21q)
- Q1. What type of task is viewed when using the Seq2Seq model in speech recognition?...
- Q2. Transformer models outperform LSTM when analyzing and processing long-distance dependencie...
- Q3. The accuracy of object location detection can be evaluated using the intersection over uni...
- Q4. Which of the following methods are useful when tackling overfitting?...
- Q5. Which of the following statements about the functions of layer normalization and residual ...
- Q6. Maximum likelihood estimation (MLE) can be used for parameter estimation in a Gaussian mix...
- Q7. Mel-frequency cepstral coefficients (MFCCs) take into account human auditory characteristi...
- Q8. Which of the following is not an algorithm for training word vectors?...
- Q9. Which of the following are the impacts of the development of large models?...
- Q10. Which of the following has never been used as a method in the history of NLP?...
- Q11. The U-Net uses an upsampling mechanism and has a fully-connected layer....
- Q12. Huawei Cloud ModelArts is a one-stop AI development platform that supports multiple AI sce...
- Q13. Which of the following statements are true about the differences between using convolution...
- Q14. Which of the following statements about the standard normal distribution are true?...
- Q15. The natural language processing field usually uses distributed semantic representation to ...
- Q16. In the image recognition algorithm, the structure design of the convolutional layer has a ...
- Q17. How many parameters need to be learned when a 3 × 3 convolution kernel is used to perform ...
- Q18. In an image preprocessing experiment, the cv2.imread("lena.png", 1) function provided by O...
- Q19. What are the advantages of deep learning-based speech recognition algorithms?...
- Q20. In cases where the bright and dark areas of an image are too extreme, which of the followi...
- Q21. In 2017, the Google machine translation team proposed the Transformer in their paperAttent...
