Subscribe to Updates
Subscribe to get the latest content in real time.
Browsing: scholar
Authors: Seonghee Lee、Maho Kohga、Steve Landau、
Authors: Neha R. Gupta、Jessica Hullman、Hari Subramonyam Paper: https://arxiv.org/abs/2408.10239 Introduction Machine learning (ML) model evaluation traditionally focuses on estimating prediction errors…
Authors: Jiajun Xu、Qun Wang、Yuhang Cao、Baitao Zeng、Sicheng Liu Paper: https://arxiv.org/abs/2408.10230 Introduction In recent years, Virtual Assistants (VAs) such as Amazon’s Alexa,…
Authors: Zhiyong Zhang、Aniket Gupta、Huaizu Jiang、Hanumant Singh Paper: https://arxiv.org/abs/2408.10161 Introduction Optical flow estimation is a critical task in computer vision, enabling…
Authors: Hendrik Alsmeier、Anton Savchenko、Rolf Findeisen Paper: https://arxiv.org/abs/2408.09781 Introduction Model Predictive Control (MPC) has become a cornerstone in various industries, from…
Authors: Florian Grötschla、Joël Mathys、Christoffer Raun、Roger Wattenhofer Paper: https://arxiv.org/abs/2408.11042 Introduction Machine learning has made significant strides in various domains, yet it…
Authors: Poppy Collis、Ryan Singh、Paul F Kinghorn、Christopher L Buckley Paper: https://arxiv.org/abs/2408.10970 Introduction In the realm of artificial intelligence, one of the…
Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models
Authors: Yuyan Chen、Chenwei Wu、Songzhou Yan、Panjun Liu、Haoyu Zhou、Yanghua Xiao Paper: https://arxiv.org/abs/2408.10947 Introduction Background Large Language Models (LLMs) have shown remarkable performance…
Authors: Xinyu Liu、Ke Jin Paper: https://arxiv.org/abs/2408.10921 MTFinEval: A Comprehensive Multi-domain Chinese Financial Benchmark Introduction In the rapidly evolving field of…
Authors: Baekryun Seong、Jieung Kim、Sang-Ki Ko Paper: https://arxiv.org/abs/2408.10900 Introduction Artificial Intelligence (AI) research has recently been dominated by large language models…