期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

赵钊原培新唐俊文陈锦林《东北大学学报(自然科学版)》2023,(11):1548-1555

针对SNN-HRL等传统Skill discovery类算法存在的探索困难问题,本文基于SNN-HRL算法提出了融合多种探索策略的分层强化学习算法MES-HRL,改进传统分层结构,算法包括探索轨迹、学习轨迹、路径规划三层.在探索轨迹层,训练智能体尽可能多地探索未知环境,为后续的训练过程提供足够的环境状态信息.在学习轨迹层,将探索轨迹层的训练结果作为“先验知识”用于该层训练,提高训练效率.在路径规划层,利用智能体之前获得的skill来完成路径规划任务.通过仿真对比MES-HRL与SNN-HRL算法在不同环境下的性能表现,仿真结果显示,MES-HRL算法解决了传统算法的探索问题,具有更出色的路径规划能力. 相似文献

2.

基于强化学习的三维游戏控制算法

孟琭沈凝祁殷俏张昊园《东北大学学报(自然科学版)》2021,42(4):478-483

基于强化学习,设计了一个面向三维第一人称射击游戏(DOOM)的智能体,该智能体可在游戏环境下移动、射击敌人、收集物品等.本文算法结合深度学习的目标识别算法Faster RCNN与Deep Q-Networks(DQN)算法,可将DQN算法的搜索空间大大减小,从而极大提升本文算法的训练效率.在虚拟游戏平台(ViZDoom)的两个场景下(Defend_the_center和Health_gathering)进行实验,将本文算法与最新的三维射击游戏智能体算法进行比较,结果表明本文算法可以用更少的迭代次数实现更优的训练结果. 相似文献

3.

一种用于自主学习的虚拟仿真环境

钟方威王亦洲《中国传媒大学学报》2021,28(1):6-10

赋予智能体通过与环境交互自主学习的能力是实现下一代人工智能的关键.本文,我们介绍了一种基于虚幻4的虚拟仿真环境,用于训练和测试自主智能体.该环境具有高逼真、可交互、灵活通用的特点,使得智能体能够在其中自由探索,自主学习场景感知、常识推理、决策控制等多项能力.为了验证该环境的可用性,我们用实验演示了如何在虚拟环境中构建自主智能,即利用强化学习方法训练端到端的神经网络实现基于视觉感知的目标搜索和目标追踪任务. 相似文献

4.

基于群智能的NPC行为建模研究

魏凌华张栋冰范祺《淮北煤炭师范学院学报(自然科学版)》2014,(3):62-65

人工智能技术的引入提高游戏中NPC的智能程度,但是NPC之间并没有通信能力,这一点极大地减弱NPC协同完成任务的能力,降低游戏的可玩性.针对NPC的协同处理能力问题,提出一种基于群智能的协同算法.该算法增加NPC之间的协同处理功能,有效地解决了NPC协同问题,提高游戏的可玩性,并从设计和时间上证明该算法的可行性. 相似文献

5.

一种基于元学习的改进深度强化学习算法

《扬州大学学报(自然科学版)》2021,(3)

传统的深度强化学习算法在解决任务时与环境交互量大且样本复杂度高,导致智能体的训练时间长,算法难以收敛,故在实际问题中的应用受限.针对该问题,在智能体采用梯度下降方法更新模型参数的过程中融入元学习思想,提出一种改进的深度强化学习算法,使得智能体利用在训练任务中学习到的先验知识快速地适应新任务.仿真结果表明:改进的深度强化学习算法可实现智能体在新任务上的快速适应,其收敛速度和稳定性等均优于传统算法. 相似文献

6.

基于强化协作博弈方法的双车道混合交通流特性

郭静秋方守恩曲小波王亦兵刘洋泽西《同济大学学报(自然科学版)》2019,47(7):0976-0983

对元胞自动机引入Gipps跟驰模型,并结合改进的Q强化学习方法分别建立普通车辆及智能网联车的微观行驶策略,提出了一种新型的混合交通流演化仿真方法.然后,利用数值模拟方式对双车道交通环境进行仿真,探索智能网联车对混合交通流的动态影响.结果表明,相比于元胞自动机构建的普通车辆智能体,改进的Q强化学习方法训练的智能网联车智能体具备更强的连续时空环境适应能力,双车道环境下道路通行能力随着智能网联车渗透率的提升而增大,最高可提升45.34%.此外,智能网联车渗透率的提高会降低车群低效的换道行为,拓宽高通行能力水平下的车辆密度范围,有利于改善交通拥堵. 相似文献

7.

基于神经网络和模糊推理的移动机器人行为决策与控制

彭刚黄心汉杨涛高健伍翼熊有伦《华中科技大学学报(自然科学版)》2004,(Z1)

以足球机器人系统为实验平台 ,针对移动机器人智能决策中的实际问题 ,提出了一种基于径向基函数神经网络的机器人行为决策方法 ,通过神经元学习和训练以及自身的泛化能力 ,可以很好地利用多源信息进行机器人行为决策 ,以提高行为决策的有效性 .同时为了保证行为决策的实施效果 ,将模糊推理技术与传统的PID控制相结合 ,既保证了移动机器人系统运动控制的准确性和稳定性 ,又缩短了动态调整时间 ,取得了较好的控制效果 . 相似文献

8.

景深约束下的深度强化学习机器人路径规划

《华中科技大学学报(自然科学版)》2018,(12)

为了提高未知环境下移动机器人的探索能力,基于深度强化学习训练提出一种基于最小深度信息有选择的训练模式,通过运动学方程约束,优化了状态空间的搜索与采集,提高了训练速率.在仿真未知环境中通过将RGB-D传感器的深度图像作为机器人的状态输入,学习模型将直接输出机器人的速度与角度并进行运动决策,验证了机器人路径规划控制策略.研究结果表明:在相同的训练时间下,所提出的训练模式对未知环境有更好的探索能力. 相似文献

9.

针对不可微多阶段算法的环境升级式强化学习方法

谢树钦陈梓天徐超卢策吾《重庆邮电大学学报(自然科学版)》2020,32(5):857-858

多阶段算法的研究目前已取得很大进展,但仍存在2个重要问题。在推理阶段,信息不能从下游反馈到上游。在训练阶段,当整个模型涉及不可微函数时无法进行端到端的训练,因此不同阶段不能联合优化。提出一种新的环境升级式强化学习方法来解决反馈和联合优化问题,该方法的框架结构是通过一个强化学习智能体将下游阶段与上游阶段重新连接起来,利用优化上游阶段的输出来训练智能体,以提高最终性能,同时根据智能体的策略对下游阶段(环境)进行升级,实现智能体策略和环境的联合优化。针对智能体和环境的不同训练需求,还提出了一种基于该框架的训练算法,并在实例分割和人体姿态估计实验中证明了其有效性。相似文献

10.

基于近端策略优化的作战实体博弈对抗算法

张振黄炎焱张永亮陈天德《南京理工大学学报(自然科学版)》2021,45(1):77-83

针对一种大地图和稀疏奖励的兵棋推演对抗环境下,单纯的深度强化学习算法会导致训练无法快速收敛以及智能体对抗特定规则智能体胜率较低的问题,提出了一种基于监督学习和深度强化学习相结合以及设置额外奖励的方法,旨在提升智能博弈的训练效果.使用监督学习训练智能体;研究基于近端策略优化(Proximal policy optimiz... 相似文献

11.

Markedness and its Interpretation in Language Use

张瑞华《井冈山学院学报》2007,28(Z1)

Language markedness is a common phenomenon in languages, and is reflected from hearing, vision and sense, i.e. the variation in the three aspects such as phonology, morphology and semantics. This paper focuses on the interpretation of markedness in language use following the three perspectives, i.e. pragmatic interpretation, psychological interpretation and cognitive interpretation, with an aim to define the function of markedness. 相似文献

12.

Features of Women's language

何延凌《科技信息》2008,(4):258-258

Language is a means of verbal communication. People use language to communicate with each other. In the society, no two speakers are exactly alike in the way of speaking. Some differences are due to age, gender, statue and personality. Above all, gender is one of the obvious reasons. The writer of this paper tries to describe the features of women＇s language from these perspectives： pronunciation, intonation, diction, subjects, grammar and discourse. From the discussion of the features of women＇s language, more attention should be paid to language use in social context. What＇s more, the linguistic phenomena in a speaking community can be understood more thoroughly. 相似文献

13.

An Analysis of Catherine＇s Character——Wuthering Heights

王慧《科技信息》2008,(10):240-240

Wuthering Heights, Emily Bronte＇s only novel, was published in December of 1847 under the pseudonym Ellis Bell. The book did not gain immediate success, but it is now thought one of the finest novels in the English language. Catherine is the key character of this masterpiece, because everybody and everything center on her though she had a short life. We can understand this masterpiece better if we know Catherine well. 相似文献

14.

Lower Paleozoic oil relationships within Williston Basin, Canada

Stephen L.Bend Mauri C.Smith 《成都理工大学学报(自然科学版)》2004,31(6)

The Williston Basin is a significant petroleum province, containing oil production zones that include the Middle Cambrian to Lower Ordovician, Upper Ordovician, Middle Devonian, Upper Devonian and Mississippian and within the Jurassic and Cretaceous. The oils of the Williston Basin exhibit a wide range of geochemical characteristics defined as "oil families", although the geochemical signature of the Cambrian Deadwood Formation and Lower Ordovician Winnipeg reservoired oils does not match any "oil family". Despite their close stratigraphic proximity, it is evident that the oils of the Lower Palaeozoic within the Williston Basin are distinct. This suggests the presence of a new "oil family" within the Williston Basin. Diagnostic geochemical signatures occur in the gasoline range chromatograms, within saturate fraction gas chromatograms and biomarker fingerprints. However, some of the established criteria and cross-plots that are currently used to segregate oils into distinct genetic families within the basin do not always meet with success, particularly when applied to the Lower Palaeozoic oils of the Deadwood and Winnipeg Formation. 相似文献

15.

低渗透非均质砂岩油藏启动压力梯度研究

下载免费PDF全文

刘丽《科学技术与工程》2015,15(3)

理论推导与室内实验相结合,建立了低渗透非均质砂岩油藏启动压力梯度确定方法。首先借助油藏流场与电场相似的原理,推导了非均质砂岩油藏启动压力梯度计算公式。其次基于稳定流实验方法,建立了非均质砂岩油藏启动压力梯度测试方法。结果表明:低渗透非均质砂岩油藏的启动压力梯度确定遵循两个等效原则。平面非均质油藏的启动压力梯度等于各级渗透率段的启动压力梯度关于长度的加权平均;纵向非均质油藏的启动压力梯度等于各渗透率层的启动压力梯度关于渗透率与渗流面积乘积的加权平均。研究成果可用于有效指导低渗透非均质砂岩油藏的合理井距确定,促进该类油藏的高效开发。相似文献

16.

On Tragic Consciousness In The Works By Ernest Hemingway

蒋建华《科技信息》2008,(18)

As an American modern novelist who were famous in the literary world, Hemingway was not a person who always followed the trend but a sharp observer. At the same time, he was a tragedy maestro, he paid great attention on existence, fate and end-result. The dramatis personae's tragedy of his works was an extreme limit by all means tragedy on the meaning of fearless challenge that failed. The beauty of tragedy was not produced on the destruction of life, but now this kind of value was in the impact activity. They performed for the reader about the tragedy on challenging for the limit and the death. 相似文献

17.

Design and analysis of generic LBS application developing platform

XIA Ying GE Jun-wei BAE Hae-young 《重庆邮电大学学报(自然科学版)》2004,16(5)

Location based services is promising due to its novel working style and contents.A software platform is proposed to provide application programs of typical location based services and support new applications developing efficiently. The analysis shows that this scheme is easy implemented, low cost and adapt to all kinds of mobile nework system. 相似文献

18.

Achieving Unusual States of Matter Under High Pressure

Miao Maosheng 《华南师范大学学报(自然科学版)》2014,(6)

正The periodicity of the elements and the non-reactivity of the inner-shell electrons are two related principles of chemistry,rooted in the atomic shell structure.Within compounds,Group I elements,for example,invariably assume the+1 oxidation state,and their chemical properties differ completely from those of the p-block elements.These general rules govern our understanding of chemical structures and reactions.Using first principles calcula- 相似文献

19.

Exchange-Correlation and Electronic Excitation Energies from Pairing Matrix Fluctuations

Yang Weitao 《华南师范大学学报(自然科学版)》2014,46(6):137-138

We have developed an adiabatic connection to formulate the ground-state exchange-correlation energy in terms of pairing matrix linear fluctuations.This formulation of the exchange-correlation energy opens a new channel for density functional approximations based on the many-body perturbation theory.We illustrate the potential of such approaches with an approximation based on the particle-particle Random Phase Approximation(pp-RPA).This re- 相似文献

20.

Density Functional Theory for the Response of Periodic Systems to Electric Fields Based on the Vector Potential Approach

Bernard Kirtman 《华南师范大学学报(自然科学版)》2014,(6)

正The electronic and nuclear(structural/vibrational)response of 1D-3D nanoscale systems to electric fields gives rise to a host of optical,mechanical,spectral,etc.properties that are of high theoretical and applied interest.Due to the computational difficulty of treating such large systems it is convenient to model them as infinite and periodic(at least,in first approximation).The fundamental theoretical/computational problem in doing so is that 相似文献