Jelena Mitrović
Jecovit
AI & ML interests
NLP, LLMs
Recent Activity
upvoted an article 5 days ago
KV Caching Explained: Optimizing Transformer Inference Efficiency liked a dataset 12 days ago
mteb/WebFAQRetrieval upvoted an article 9 months ago
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval