Building a bot playing Rock, Paper, Scissors using Hierarchical Temporal Memory being as good LSTM

2018-12-10T19:37:35+08:00

Very interesting!

LikeLiked by 1 person

Reply

2018-12-12T01:05:05+08:00

Thank you 😀

LikeLike

Reply

2018-12-12T14:55:33+08:00

Many people say that search engine optimization is really a waste vitality.
Of course that is certainly why you’re going to ought to do this
task. They try to match it to a different picture or number. http://www.cdzthygs.com/comment/html/?8621.html

LikeLike

Reply

	size_t hidden_size = 30;
	size_t seq_len = 3; //The length of traning srquence
	network<sequential> nn;
	nn << recurrent(lstm(3, hidden_size), seq_len);
	nn << leaky_relu() << fc(hidden_size) << softmax();

	xt::xarray<float> compute(xt::xarray<float> input)
	{
	assert(input.size() == 3);
	//save data for traning
	if(last_input_.size() != 0) {
	for(auto v : last_input_)
	input_.push_back(v);
	for(auto v : input)
	output_.push_back(v);
	}
	last_input_ = vec_t(input.begin(), input.end());

	//Train once all needed data collected
	if(input_.size() == RNN_DATA_PER_EPOCH) {
	assert(input_.size() == output_.size());

	//Set the netwotk into a "traning more"
	nn.at<recurrent_layer>(0).seq_len(RNN_DATA_PER_EPOCH);
	nn.set_netphase(net_phase::train);
	nn.fit<cross_entropy_multiclass>(optimizer_, std::vector<vec_t>({input_}),std::vector<vec_t>({output_}), 1, 1, [](){},[](){});

	//Leave the "leaning" mode. Keep on predicting
	nn.set_netphase(net_phase::test);
	nn.at<recurrent_layer>(0).seq_len(1);

	input_.clear();
	output_.clear();
	}


	//Predict the opponent's next mvoe
	vec_t out = nn_.predict(vec_t(input.begin(), input.end()));

	assert(out.size() == 3);
	//Convert the prediction to xarray
	xt::xarray<float> r = xt::zeros<float>({3});
	for(size_t i=0;i<out.size();i++)
	r[i] = out[i];

	return r;
	}

	TemporalMemory tm(3*ENCODE_WIDTH, TP_DEPTH);
	tm->setMaxNewSynapseCount(64);
	tm->setPermanenceIncrement(0.1);
	tm->setPermanenceDecrement(0.045);
	tm->setConnectedPermanence(0.4);
	tm->setPredictedSegmentDecrement(0.32.0ftm_->getPermanenceIncrement());

	Klaus Tachtler on The journey of running Arch Li…
	Dudley Arcia on Install AMD HIP on Nvidia…
	Yotam on How fast is cling/ROOT? Compar…
	Arch Linux berusia 2… on The journey of running Arch Li…
	Arch Linux Berusia 2… on The journey of running Arch Li…

	Klaus Tachtler on The journey of running Arch Li…
	Dudley Arcia on Install AMD HIP on Nvidia…
	Yotam on How fast is cling/ROOT? Compar…
	Arch Linux berusia 2… on The journey of running Arch Li…
	Arch Linux Berusia 2… on The journey of running Arch Li…

Building a bot playing Rock, Paper, Scissors using Hierarchical Temporal Memory being as good LSTM

The RNN Agent

The HTM Agent

Playing the game

Conclusion

3 thoughts on “Building a bot playing Rock, Paper, Scissors using Hierarchical Temporal Memory being as good LSTM”

Add yours

Leave a reply to Hyunsung Go Cancel reply

Recent Posts

Recent Comments

Archives

Catgories

Meta

	xt::xarray<float> compute(int last_oppo_move, bool learn = true)
	{
	auto out = tm.compute(encode(last_oppo_move), true); //true means enable learning
	return categroize(3, ENCODE_WIDTH, out); //Convert from SDR to properablity distribution
	}

The RNN Agent

The HTM Agent

Playing the game

Conclusion

分享此文：

3 thoughts on “Building a bot playing Rock, Paper, Scissors using Hierarchical Temporal Memory being as good LSTM”

Add yours

Leave a reply to Hyunsung Go Cancel reply

Recent Posts

Recent Comments

Archives

Catgories

Meta