Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

Published in NAACL, 2024