Govur University Logo
--> --> --> -->
...

When employing a cross-encoder for re-ranking retrieved documents, why does the architectural requirement to process the query and document simultaneously preclude the use of pre-computed embeddings?



A cross-encoder processes a query and a document by feeding them into a neural network together as a single input sequence. Within this architecture, the model uses self-attention mechanisms to allow every word in the query to interact directly with every word in the document at the earliest layers of the network. This deep, token-level interaction is w....

Log in to view the answer



Redundant Elements