Java Action类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中burlap.mdp.core.action.Action类的典型用法代码示例。如果您正苦于以下问题：Java Action类的具体用法？Java Action怎么用？Java Action使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

Action类属于burlap.mdp.core.action包，在下文中一共展示了Action类的20个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: sample

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public DecisionState sample(State state, Action action) {
    List<StateTransitionProb> reachableStates;
    try {
        reachableStates = stateTransitions(state, action);
    } catch (NullPointerException e) {
        reachableStates = Collections.singletonList(new StateTransitionProb(deadEnd, 1.0));
    }
    Collections.shuffle(reachableStates);

    //sample random roll
    double randomThreshold = Math.random(), sumOfProbability = 0;
    for (StateTransitionProb reachableState : reachableStates) {
        sumOfProbability = sumOfProbability + reachableState.p;
        if (randomThreshold <= sumOfProbability) {
            return ((DecisionState) reachableState.s).copy();
        }
    }
    throw new IndexOutOfBoundsException("No state found!");
}

开发者ID:honzaMaly，项目名称:kusanagi，代码行数:21，代码来源:DecisionModel.java

示例2: qValue

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public double qValue(State s, Action a) {

	if(this.model.terminal(s)){
		return 0.;
	}

	//what are the possible outcomes?
	List<TransitionProb> tps = ((FullModel)this.model).transitions(s, a);

	//aggregate over each possible outcome
	double q = 0.;
	for(TransitionProb tp : tps){
		//what is reward for this transition?
		double r = tp.eo.r;

		//what is the value for the next state?
		double vp = this.valueFunction.get(this.hashingFactory.hashState(tp.eo.op));

		//add contribution weighted by transition probability and
		//discounting the next state
		q += tp.p * (r + this.gamma * vp);
	}

	return q;
}

开发者ID:jmacglashan，项目名称:burlap_examples，代码行数:27，代码来源:VITutorial.java

示例3: qValues

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public List<QValue> qValues(State s) {
	//first get hashed state
	HashableState sh = this.hashingFactory.hashState(s);

	//check if we already have stored values
	List<QValue> qs = this.qValues.get(sh);

	//create and add initialized Q-values if we don't have them stored for this state
	if(qs == null){
		List<Action> actions = this.applicableActions(s);
		qs = new ArrayList<QValue>(actions.size());
		//create a Q-value for each action
		for(Action a : actions){
			//add q with initialized value
			qs.add(new QValue(s, a, this.qinit.qValue(s, a)));
		}
		//store this for later
		this.qValues.put(sh, qs);
	}

	return qs;
}

开发者ID:jmacglashan，项目名称:burlap_examples，代码行数:24，代码来源:QLTutorial.java

示例4: actionDir

import burlap.mdp.core.action.Action; //导入依赖的package包/类
protected int actionDir(Action a){
	int adir = -1;
	if(a.actionName().equals(ACTION_NORTH)){
		adir = 0;
	}
	else if(a.actionName().equals(ACTION_SOUTH)){
		adir = 1;
	}
	else if(a.actionName().equals(ACTION_EAST)){
		adir = 2;
	}
	else if(a.actionName().equals(ACTION_WEST)){
		adir = 3;
	}
	return adir;
}

开发者ID:jmacglashan，项目名称:burlap_examples，代码行数:17，代码来源:ExampleOOGridWorld.java

示例5: executeAction

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public EnvironmentOutcome executeAction(Action a) {
	State startState = this.currentObservation();
	
	ActionController ac = this.actionControllerMap.get(a.actionName());
	int delay = ac.executeAction(a);
	if (delay > 0) {
		try {
			Thread.sleep(delay);
		} catch(InterruptedException e) {
			e.printStackTrace();
		}
	}
	
	State finalState = this.currentObservation();
	
	this.lastReward = this.rewardFunction.reward(startState, a, finalState);
	
	EnvironmentOutcome eo = new EnvironmentOutcome(startState, a, finalState, this.lastReward, this.isInTerminalState());
	
	return eo;
}

开发者ID:h2r，项目名称:burlapcraft，代码行数:23，代码来源:MinecraftEnvironment.java

示例6: allApplicableActions

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public List<Action> allApplicableActions(State s) {
	BCAgent a = (BCAgent)((GenericOOState)s).object(CLASS_AGENT);

	List<ObjectInstance> blocks = ((OOState)s).objectsOfClass(HelperNameSpace.CLASS_BLOCK);
	for (ObjectInstance block : blocks) {
		if (HelperActions.blockIsOneOf(Block.getBlockById(((BCBlock)block).type), HelperActions.dangerBlocks)) {
			int dangerX = ((BCBlock)block).x;
			int dangerY = ((BCBlock)block).y;
			int dangerZ = ((BCBlock)block).z;
			if ((a.x == dangerX) && (a.y - 1 == dangerY) && (a.z == dangerZ) || (a.x == dangerX) && (a.y == dangerY) && (a.z == dangerZ)) {
				return new ArrayList<Action>();
			}
		}
	}

	//otherwise we pass check
	return Arrays.<Action>asList(new SimpleAction(typeName));
}

开发者ID:h2r，项目名称:burlapcraft，代码行数:20，代码来源:MinecraftActionType.java

示例7: publishAction

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public int publishAction(Action a) {
	Timer timer = new Timer();
	PublishTask pt = new PublishTask();
	timer.schedule(pt, 0, this.period);
	if(this.synchronous){
		synchronized(pt) {
			while(!pt.finished()) {
				try {
					pt.wait();
				} catch(InterruptedException e) {
					e.printStackTrace();
				}
			}
		}
	}

	return this.delayTime;
}

开发者ID:h2r，项目名称:burlap_rosbridge，代码行数:20，代码来源:RepeatingActionPublisher.java

示例8: action

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public Action action(State s) {
	
	if(this.dp == null){
		throw new RuntimeException("The valueFunction used by this Policy is not defined; therefore, the policy is undefined.");
	}
	
	if(this.dp.hasCachedPlanForState(s)){
		Action ga = this.dp.querySelectedActionForState(s);
		//the surrounding if condition will probably be sufficient for null cases, but doing double check just to make sure.
		if(ga == null){
			throw new PolicyUndefinedException();
		}
		return ga;
	}
	throw new PolicyUndefinedException();
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:18，代码来源:SDPlannerPolicy.java

示例9: collectDataFrom

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public SARSData collectDataFrom(State s, SampleModel model, int maxSteps, SARSData intoDataset) {
	
	if(intoDataset == null){
		intoDataset = new SARSData();
	}
	
	State curState = s;
	int nsteps = 0;
	boolean terminated = model.terminal(s);
	while(!terminated && nsteps < maxSteps){
		
		List<Action> gas = ActionUtils.allApplicableActionsForTypes(this.actionTypes, curState);
		Action ga = gas.get(RandomFactory.getMapped(0).nextInt(gas.size()));
		EnvironmentOutcome eo = model.sample(curState, ga);
		intoDataset.add(curState, ga, eo.r, eo.op);
		curState = eo.op;
		terminated = eo.terminated;
		nsteps++;
		
	}
	
	
	return intoDataset;
	
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:27，代码来源:SARSCollector.java

示例10: estimateQs

import burlap.mdp.core.action.Action; //导入依赖的package包/类
/**
 * Estimates and returns the Q-values for this node. Q-values and used state samples are forgotten after this call completes.
 * @return a {@link List} of the estiamted Q-values for each action.
 */
public List<QValue> estimateQs(){
	List<Action> gas = SparseSampling.this.applicableActions(this.sh.s());
	List<QValue> qs = new ArrayList<QValue>(gas.size());
	for(Action ga : gas){
		if(this.height <= 0){
			qs.add(new QValue(this.sh.s(), ga, SparseSampling.this.vinit.value(this.sh.s())));
		}
		else{
			double q;
			if(!SparseSampling.this.computeExactValueFunction){
				q = this.sampledQEstimate(ga);
			}
			else{
				q = this.exactQValue(ga);
			}
			
			qs.add(new QValue(this.sh.s(), ga, q));
		}
	}
	
	return qs;
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:27，代码来源:SparseSampling.java

示例11: allApplicableActions

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public List<Action> allApplicableActions(State s) {

	List <Action> res = new ArrayList<Action>();


	if(!(s instanceof OOState)){
		throw new RuntimeException("Cannot get object-parameterized grounded actions in state, because " + s.getClass().getName() + " does not implement OOState");
	}

	//otherwise need to do parameter binding
	List <List <String>> bindings = OOStateUtilities.getPossibleBindingsGivenParamOrderGroups((OOState)s, this.getParameterClasses(), this.getParameterOrderGroups());

	for(List <String> params : bindings){
		String [] aprams = params.toArray(new String[params.size()]);
		ObjectParameterizedAction ga = this.generateAction(aprams);
		if(this.applicableInState(s, ga)) {
			res.add(ga);
		}
	}

	return res;

}

开发者ID:jmacglashan，项目名称:burlap，代码行数:25，代码来源:ObjectParameterizedActionType.java

示例12: action

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public Action action(State s) {

	synchronized(this){
		while(this.nextAction == null){
			try {
				this.wait();
			} catch(InterruptedException e) {
				e.printStackTrace();
			}
		}
	}
	Action toTake = this.nextAction;
	this.nextAction = null;
	return toTake;
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:17，代码来源:ManualAgentsCommands.java

示例13: UCTStateNode

import burlap.mdp.core.action.Action; //导入依赖的package包/类
/**
 * Initializes the UCT state node.
 * @param s the state that this node wraps
 * @param d the depth of the node
 * @param actionTypes the possible OO-MDP actions that can be taken
 * @param constructor a {@link UCTActionNode} factory that can be used to create ActionNodes for each of the actions.
 */
public UCTStateNode(HashableState s, int d, List <ActionType> actionTypes, UCTActionConstructor constructor){
	
	state = s;
	depth = d;
	
	n = 0;
	
	actionNodes = new ArrayList<UCTActionNode>();

	List<Action> actions = ActionUtils.allApplicableActionsForTypes(actionTypes, s.s());
	for(Action a : actions){
		UCTActionNode an = constructor.generate(a);
		actionNodes.add(an);
	}

}

开发者ID:jmacglashan，项目名称:burlap，代码行数:24，代码来源:UCTStateNode.java

示例14: getAgentSynchronizedActionSelection

import burlap.mdp.core.action.Action; //导入依赖的package包/类
/**
 * This method returns the action for a single agent by a synchronized sampling of this joint policy,
 * which enables multiple agents to query this policy object and act according to the same selected joint
 * actions from it. This is useful when decisions are made from a "referee" who selects the joint action
 * that dictates the behavior of each agent. The synchronization is implemented by selecting a joint action.
 * Each time an agent queries for their action, it is drawn from the previously sampled joint action.
 * A new joint action is only selected after each agent defined in this objects {@link #agentsInJointPolicy} member 
 * has queried this method for their action or until an action for a different state is queried (that is, *either* condition
 * will cause the joint action to be resampled).
 * @param agentNum the agent whose action in this joint policy is being queried
 * @param s the state in which the action is to be selected.
 * @return the single agent action to be taken according to the synchonrized joint action that was selected.
 */
public Action getAgentSynchronizedActionSelection(int agentNum, State s){
	
	if(this.lastSyncedState == null || !this.lastSyncedState.equals(s)){
		//then reset syncrhonization
		this.lastSyncedState = s;
		this.agentsSynchronizedSoFar.clear();
		this.lastSynchronizedJointAction = (JointAction)this.action(s);
	}
	
	Action a = this.lastSynchronizedJointAction.action(agentNum);
	this.agentsSynchronizedSoFar.add(agentNum);
	if(this.agentsSynchronizedSoFar.size() == this.agentsInJointPolicy.size()){
		//then we're finished getting the actions for all agents and enable the next query
		this.lastSyncedState = null;
		this.agentsSynchronizedSoFar.clear();
	}
	
	return a;
	
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:34，代码来源:JointPolicy.java

示例15: policyDistribution

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public List<ActionProb> policyDistribution(State s) {

	if(!(this.sourcePolicy instanceof EnumerablePolicy)){
		throw new RuntimeException("Cannot return policy distribution because source policy does not implement EnumerablePolicy");
	}

	List<Action> unmodeled = KWIKModel.Helper.unmodeledActions(model, allActionTypes, s);

	if(!unmodeled.isEmpty()){
		List<ActionProb> aps = new ArrayList<ActionProb>(unmodeled.size());
		double p = 1./(double)unmodeled.size();
		for(Action ga : unmodeled){
			aps.add(new ActionProb(ga, p));
		}
		return aps;
	}

	return ((EnumerablePolicy)this.sourcePolicy).policyDistribution(s);
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:21，代码来源:UnmodeledFavoredPolicy.java

示例16: sample

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public State sample(State s, Action a) {

	s = s.copy();

	double [] directionProbs = transitionDynamics[actionInd(a.actionName())];
	double roll = rand.nextDouble();
	double curSum = 0.;
	int dir = 0;
	for(int i = 0; i < directionProbs.length; i++){
		curSum += directionProbs[i];
		if(roll < curSum){
			dir = i;
			break;
		}
	}

	int [] dcomps = movementDirectionFromIndex(dir);
	return move(s, dcomps[0], dcomps[1]);

}

开发者ID:jmacglashan，项目名称:burlap，代码行数:22，代码来源:GridWorldDomain.java

示例17: reward

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public double reward(State s, Action a, State sprime) {

	double [] features;
	if(this.rfFeaturesAreForNextState){
		features = this.rfFvGen.features(sprime);
	}
	else{
		features = this.rfFvGen.features(s);
	}
	double sum = 0.;
	for(int i = 0; i < features.length; i++){
		sum += features[i] * this.parameters[i];
	}
	return sum;

}

开发者ID:jmacglashan，项目名称:burlap，代码行数:18，代码来源:LinearDiffRFVInit.java

示例18: getNode

import burlap.mdp.core.action.Action; //导入依赖的package包/类
/**
 * Returns the policy node that stores the action preferences for state.
 * @param sh The (hashed) state of the {@link BoltzmannActor.PolicyNode} to return
 * @return the {@link BoltzmannActor.PolicyNode} object for the given input state.
 */
protected PolicyNode getNode(HashableState sh){
	
	//List <GroundedAction> gas = sh.s.getAllGroundedActionsFor(this.actions);
	List<Action> gas = ActionUtils.allApplicableActionsForTypes(this.actionTypes, sh.s());
	
	PolicyNode node = this.preferences.get(sh);
	if(node == null){
		node = new PolicyNode(sh);
		for(Action ga : gas){
			node.addPreference(new ActionPreference(ga, 0.0));
		}
		this.preferences.put(sh, node);
	}
	
	return node;
	
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:23，代码来源:BoltzmannActor.java

示例19: sample

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public State sample(State s, Action a) {

	s = s.copy();

	double baseForce = 0.;
	if(a.actionName().equals(CartPoleDomain.ACTION_LEFT)){
		baseForce = -physParams.actionForce;
	}
	else if(a.actionName().equals(CartPoleDomain.ACTION_RIGHT)){
		baseForce = physParams.actionForce;
	}


	double roll = RandomFactory.getMapped(0).nextDouble() * (2 * physParams.actionNoise) - physParams.actionNoise;
	double force = baseForce + roll;

	return updateState(s, force);
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:20，代码来源:IPModel.java

示例20: evaluate

import burlap.mdp.core.action.Action; //导入依赖的package包/类
@Override
public double evaluate(State s, Action a) {

	List<StateFeature> features = this.stateActionFeatures.features(s, a);
	double val = 0.;
	for(StateFeature sf : features){
		double prod = sf.value * this.getWeight(sf.id);
		val += prod;
	}
	this.currentValue = val;
	this.currentGradient = null;
	this.currentFeatures = features;
	this.lastState = s;
	this.lastAction = a;
	return val;
}

开发者ID:jmacglashan，项目名称:burlap，代码行数:17，代码来源:LinearVFA.java

注：本文中的burlap.mdp.core.action.Action类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java ProcExe类代码示例发布时间：2022-05-23

Java HelpManager类代码示例发布时间：2022-05-23

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：16944|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9187|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：7833|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8190|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8102|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：8916|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8074|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7490|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8020|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7077|2022-11-06

客服电话

电子邮件

Java Action类代码示例

示例1: sample

示例2: qValue

示例3: qValues

示例4: actionDir

示例5: executeAction

示例6: allApplicableActions

示例7: publishAction

示例8: action

示例9: collectDataFrom

示例10: estimateQs

示例11: allApplicableActions

示例12: action

示例13: UCTStateNode

示例14: getAgentSynchronizedActionSelection

示例15: policyDistribution

示例16: sample

示例17: reward

示例18: getNode

示例19: sample

示例20: evaluate

请发表评论

全部评论

上一篇：

下一篇：

Delphi线程间发送消息

krishnaik06/Machine-Learning-in-90-days

joaomh/curso-de-matlab

美元符号为什么是“$”

rugk/mastodon-simplified-federation: Sim

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053