Introduction to online optimization: introduction

online learning protocol:

图片说明
characteristic:
limited feedback

Exponentially weighed average forecaster

Bounded convex loss and expert regret

Hoeffding’s's inequality
lemma 2.1:


Therorem: For any convex loss taking values in [0,1], the Exp strategy satisfies:

Exp-concave loss and expert regret

Lower bound

General convex and bounded loss is unimprovable.


Anytime strategy

For any convex loss with values in [0,1], the exp strategy with time-varying parameter satisfies for all :

Subdifferentiable loss with bounded subgradient

online finite optimization

For any loss with values in [0,1], the finite Exp strategy with parameter satisfies with probability at least :

全部评论

相关推荐

12-15 14:16
门头沟学院 Java
回家当保安:发offer的时候会背调学信网,最好不要这样。 “27届 ”和“28届以下 ”公司招聘的预期是不一样的。
实习简历求拷打
点赞 评论 收藏
分享
评论
点赞
收藏
分享

创作者周榜

更多
牛客网
牛客网在线编程
牛客网题解
牛客企业服务