A. They use contextual embeddings
B. They are sequence-dependent
C. They allow for parallel processing
D. They rely on sequence ordering
Explanation:
C is correct. Transformers enable parallel processing, making them computationally efficient, especially for large text datasets
A is incorrect because although transformers use contextual embeddings, this does not relate directly to computational efficiency
B is incorrect as RNNs, not transformers, are sequence- dependent
D is incorrect as transformers can handle sequences without a strict order due to the attention mechanism.