Abstract: Large language models (LLMs) have enabled rich conversations across domains, but current interfaces follow linear dialogue structures that limit user control during exploration. Users often ...
Abstract: Large language models (LLMs) face significant deployment challenges due to their substantial memory and computational demands. While low-precision quantization offers a promising solution, ...