爱美容
当前位置: 首页 美容百科

ai论文网(爱可可AI论文推介)

时间:2023-05-30 作者: 小编 阅读量: 2 栏目名: 美容百科

与softmax置信打分不同,能量打分可证明与输入密度一致——能量较高的样本,可被解释为发生概率较低的数据,因此不太容易受到过度置信问题的影响,可显著提高检测性能。在CIFAR-10预训练WideResNet上,与softmax置信打分相比,能量打分平均FPR降低了18.03%。使用单GPU和单一环境实例,在相同计算成本和训练时间下,DreamerV2表现超过顶级的无模型单GPU智能体Rainbow和IQN。

LG - 机器学习 CV - 计算机视觉 CL - 计算与语言

1、[LG] *Energy-based Out-of-distribution Detection

W Liu, X Wang, J D. Owens, Y Li

[University of California, San Diego & University of California, Davis & University of Wisconsin-Madison]

用基于能量打分代替softmax置信打分实现分布外检测(OOD),核心思想是用一个非概率的能量函数,将较低值赋予分布内数据,较高值赋予分布外数据。与softmax置信打分不同,能量打分可证明与输入密度一致——能量较高的样本,可被解释为发生概率较低的数据,因此不太容易受到过度置信问题的影响,可显著提高检测性能。能量打分可从纯判别分类模型中导出,不需要显式依赖密度估计器,避免了JEM等生成式模型训练的优化过程。该框架内,能量可以灵活地用作所有预训练神经分类器的评分函数,也可作为可训练代价函数来明确地塑造能量表面,实现OOD检测。在CIFAR-10预训练WideResNet上,与softmax置信打分相比,能量打分平均FPR降低了18.03% 。

Determining whether inputs are out-of-distribution (OOD) is an essential building block for safely deploying machine learning models in the open world. However, previous methods relying on the softmax confidence score suffer from overconfident posterior distributions for OOD data. We propose a unified framework for OOD detection that uses an energy score. We show that energy scores better distinguish in- and out-of-distribution samples than the traditional approach using the softmax scores. Unlike softmax confidence scores, energy scores are theoretically aligned with the probability density of the inputs and are less susceptible to the overconfidence issue. Within this framework, energy can be flexibly used as a scoring function for any pre-trained neural classifier as well as a trainable cost function to shape the energy surface explicitly for OOD detection. On a CIFAR-10 pre-trained WideResNet, using the energy score reduces the average FPR (at TPR 95%) by 18.03% compared to the softmax confidence score. With energy-based training, our method outperforms the state-of-the-art on common benchmarks.

https://weibo.com/1402400261/JoDln8nmD

2、[LG] *Mastering Atari with Discrete World Models

D Hafner, T Lillicrap, M Norouzi, J Ba

[Google Brain & DeepMind & University of Toronto]

基于离散世界模型的强化学习智能体DreamerV2在Atari游戏上达到人类水平,DreamerV2单纯从强大世界模型紧凑潜空间预测中学习如何行动,世界模型采用离散表示,并与策略分开训练。使用单GPU和单一环境实例,在相同计算成本和训练时间下,DreamerV2表现超过顶级的无模型单GPU智能体Rainbow和IQN。

Intelligent agents need to generalize from past experience to achieve goals in complex environments. World models facilitate such generalization and allow learning behaviors from imagined outcomes to increase sample-efficiency. While learning world models from image inputs has recently become feasible for some tasks, modeling Atari games accurately enough to derive successful behaviors has remained an open challenge for many years. We introduce DreamerV2, a reinforcement learning agent that learns behaviors purely from predictions in the compact latent space of a powerful world model. The world model uses discrete representations and is trained separately from the policy. DreamerV2 constitutes the first agent that achieves human-level performance on the Atari benchmark of 55 tasks by learning behaviors inside a separately trained world model. With the same computational budget and wall-clock time, DreamerV2 reaches 200M frames and exceeds the final performance of the top single-GPU agents IQN and Rainbow.

https://weibo.com/1402400261/JoDuK4E6R

3、[CL] *LEGAL-BERT: The Muppets straight out of Law School

I Chalkidis, M Fergadiotis, P Malakasiotis, N Aletras, I Androutsopoulos

[Athens University of Economics and Business & University of Sheffield]

专用于法律领域的BERT模型LEGAL-BERT,聚焦法律领域,探索将BERT模型应用于下游法律任务的方法,旨在协助法律NLP研究、计算法和法律技术应用。将BERT移植到新领域的最佳策略可能不同:开箱即用原始BERT, 或通过对特定领域语料库的额外预训练来适配BERT,以及对特定领域语料库从零开始对BERT进行预训练。文中指出,当针对最终任务微调BERT对性能有重大影响时,应该始终采用扩展网格搜索。

BERT has achieved impressive performance in several NLP tasks. However, there has been limited investigation on its adaptation guidelines in specialised domains. Here we focus on the legal domain, where we explore several approaches for applying BERT models to downstream legal tasks, evaluating on multiple datasets. Our findings indicate that the previous guidelines for pre-training and fine-tuning, often blindly followed, do not always generalize well in the legal domain. Thus we propose a systematic investigation of the available strategies when applying BERT in specialised domains. These are: (a) use the original BERT out of the box, (b) adapt BERT by additional pre-training on domain-specific corpora, and (c) pre-train BERT from scratch on domain-specific corpora. We also propose a broader hyper-parameter search space when fine-tuning for downstream tasks and we release LEGAL-BERT, a family of BERT models intended to assist legal NLP research, computational law, and legal technology applications.

https://weibo.com/1402400261/JoDADieQM

4、[LG] Gradient Flow in Sparse Neural Networks and How Lottery Tickets Win

U Evci, Y A. Ioannou, C Keskin, Y Dauphin

[Google]

用稀疏神经网络梯度流改善初始化过程,试图回答:(1)为什么随机初始化训练非结构化稀疏网络的性能较差;(2)为什么(LT)和动态稀疏训练(DST)例外?实验发现,随机初始化的非结构化稀疏神经网络在初始化时梯度流表现不佳,并提出了一种可分别缩放每个神经元初始方差的替代初始化方法。相比传统稀疏训练方法,DST方法在训练过程中显著改善了梯度流;LT并没有改善梯度流,其成功在于重新学习所源自的剪枝解决方案。

Sparse Neural Networks (NNs) can match the generalization of dense NNs using a fraction of the compute/storage for inference, and also have the potential to enable efficient training. However, naively training unstructured sparse NNs from random initialization results in significantly worse generalization, with the notable exception of Lottery Tickets (LTs) and Dynamic Sparse Training (DST). In this work, we attempt to answer: (1) why training unstructured sparse networks from random initialization performs poorly and; (2) what makes LTs and DST the exceptions? We show that sparse NNs have poor gradient flow at initialization and propose a modified initialization for unstructured connectivity. Furthermore, we find that DST methods significantly improve gradient flow during training over traditional sparse training methods. Finally, we show that LTs do not improve gradient flow, rather their success lies in re-learning the pruning solution they are derived from - however, this comes at the cost of learning novel solutions.

https://weibo.com/1402400261/JoDHMcClb

5、[LG] Online Safety Assurance for Deep Reinforcement Learning

N H. Rotman, M Schapira, A Tamar

[Hebrew University of Jerusalem & Technion]

深度强化学习的在线安全保障。安全部署学习驱动系统,需要实时确定系统行为是否确定(操作环境与训练环境相同),以便在不确定的情况下默认采用合理的启发式,称为在线安全保障问题(OSAP)。本文提出三种量化决策不确定性的方法,根据用来推断不确定性的信号不同而不同。当操作环境和训练环境匹配时,采用深度强化学习方法,但当两者不匹配时,用简单的启发式进行控制。

Recently, deep learning has been successfully applied to a variety of networking problems. A fundamental challenge is that when the operational environment for a learning-augmented system differs from its training environment, such systems often make badly informed decisions, leading to bad performance. We argue that safely deploying learning-driven systems requires being able to determine, in real time, whether system behavior is coherent, for the purpose of defaulting to a reasonable heuristic when this is not so. We term this the online safety assurance problem (OSAP). We present three approaches to quantifying decision uncertainty that differ in terms of the signal used to infer uncertainty. We illustrate the usefulness of online safety assurance in the context of the proposed deep reinforcement learning (RL) approach to video streaming. While deep RL for video streaming bests other approaches when the operational and training environments match, it is dominated by simple heuristics when the two differ. Our preliminary findings suggest that transitioning to a default policy when decision uncertainty is detected is key to enjoying the performance benefits afforded by leveraging ML without compromising on safety.

https://weibo.com/1402400261/JoDOCchd8

    推荐阅读
  • 新手刷酸产品推荐干皮(敲详细刷酸教程)

    敲详细刷酸教程刷酸也有一段时间了今天做了个小小的总结,包含了各个阶段不同的刷酸方法~—先来说一下怎么去选酸新手——维A酸乳膏进阶——水杨酸棉片(0.5%浓度)老手——果酸(30%浓度)—如果之前没刷过酸的话,记得现。

  • 刀锋鞋适合什么人穿(刀锋鞋的原理)

    刀锋鞋适合什么人穿刀锋跑鞋当然适合跑步运动的青年人,不适合体重过大人群,刀锋跑鞋是2013年6月24日在美国波特兰发布一款结构减震跑鞋。鞋底搭载了16个由高科技聚合物制成的叶片,为跑者提供最高的纵向能量反馈。保护膝盖,减少运动消耗,延长运动时间,提高运动效率。

  • 石家庄紧急寻人通告(河北多地最新公告石家庄)

    石家庄紧急寻人通告行程轨迹公布!定州紧急提醒广大市民:2月28日0时至24时,天津市新增6例确诊病例,其中滨海新区4例、东丽区2例。截至目前,本轮疫情累计报告19例阳性感染者,均在定点医院治疗。为保障广大市民身体健康与生命

  • 2.7霸道加92号油可以吗 丰田霸道3.5可以加92号油吗

    有毒苯含量不得超过1%。是的,这款搭载奥迪新一代共轨直喷柴油发动机的A6L2.7TDI绝对值得好奇了解甚至驾驶。经过几天的试驾,这款奥迪A6L2.7TDI的总里程约为960公里,油表显示还能开100公里。从这几天试驾的操控性表现来看,奥迪A6L2.7TDI作为一款豪华行政轿车,其操控性和其他奥迪A6L一样稳健舒适。轮毂方面,A6L2.7TDI配备了与2.4款相同的7辐17英寸轮毂,轮胎尺寸完全相同。

  • 蒸2个鸡蛋要放多少水(这里有答案)

    蒸2个鸡蛋要放多少水?下面更多详细答案一起来看看吧!准备好所有材料,把两个鸡蛋放在碗中打散打匀。在鸡蛋液中加入156ml的饮用水,用筷子继续打匀。打匀后拿筛子把泡沫全部过滤掉,一定要过滤干净。盖上保鲜薄膜,防止水滴进去鸡蛋羹不平。小火蒸12分钟后,把火关掉,继续焖5分钟。五分钟后拿出来后在鸡蛋液上倒上酱油和麻油就可以开动了。

  • 简笔画少女古装美人(人物简笔画女孩可爱古装)

    少女古装美人简笔画你会画吗?儿童画第1页漫画图片简笔美女漫画人物简笔画少女古装古装漫画人物,下面来看看有没有你喜欢的简笔画少女古装美人?接下来我们一起去欣赏以下简笔画作品吧!

  • 如何让蒸蛋又嫩又滑好吃不翻车呢(蒸蛋又嫩又滑做法)

    接下来我们就一起去了解一下吧!如何让蒸蛋又嫩又滑好吃不翻车呢将鸡蛋打入碗里打散,一定要打得很散。加入温开水,边加水边搅拌,多搅拌几分钟。加入适量的盐,继续打散,用小勺撇去上面的一层泡沫不要。碗上盖上盖子,也可以用保鲜膜,蒸锅里加好水烧开,把蒸碗放入蒸锅。盖好盖子,蒸12分钟左右就好了。最后起锅后可加点葱花,芝麻油。

  • 牙膏怎么选择才是好的(如何选择牙膏)

    牙膏怎么选择才是好的试挤,质量良好的牙膏应能自然地挤出,膏体比较细腻光滑,并非正常的圆条状。在阳光反射下看细腻、光滑、洁净、无杂质和发黑现象,而质量低劣的牙膏发硬挤不出,或稍一挤膏体便大量涌出。试尝,将牙膏挤出少许试尝,质量好的牙膏香味纯正,无刺激味道。如果口感粗糙,有像沙子一样的颗粒滞留在嘴里,大多是含粗糙磨擦剂的牙膏,建议立即停用。对症选用药物牙膏,根据防蛀功效选用含氟牙膏。

  • 苹果充电器怎么保护(苹果手机如何充电才保护电池)

    苹果充电器怎么保护由于iPhone使用的是锂聚合物电池,并不存在记忆效应,因此并不需要等到手机彻底没电或者是低电量之后再充电,也就是说随时都可以充电。由于手机内还有一个充放电的计量芯片,每个月只需要完成一次完整的充放电就能够将电池进行校准。建议在充电的时候使用质量合格的充电器以及数据线,由于目前iPhone手机已经支持PD的快充协议,因此使用支持PD的快充头是能够支持快充。

  • 汽车电瓶什么情况下才换(汽车电瓶换完了)

    与电动转向泵通信才是重点发动机电脑自身可是重要部件,它要坏了,可能就打不着车了,所以咱们检查电动转向泵,这个标致3008,电动转向泵油是不缺的,通过诊断电脑我们去查找泵的电路图,搞清楚泵的供电和接地以及信号线。至此真相大明,紧固螺丝,清除故障码,试车正常。我接着说,重点在于你没换电瓶之前,都是好的,她说是的是的,这个标致3008外形时尚,平时开着也省心,我很喜欢这车,但就是换了电瓶之后就这不好,那不好了。