<?xml version="1.0" encoding="UTF-8" standalone="no"?><rss xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" version="2.0">

<channel>
	<title>DatumBox</title>
	<atom:link href="http://blog.datumbox.com/feed/" rel="self" type="application/rss+xml"/>
	<link>https://blog.datumbox.com</link>
	<description>Machine Learning and Software Development Blog</description>
	<lastBuildDate>Tue, 20 May 2025 17:42:17 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	
	<item>
		<title>VernamVeil: A Fresh Take on Function-Based Encryption</title>
		<link>https://blog.datumbox.com/vernamveil-a-fresh-take-on-function-based-encryption/</link>
					<comments>https://blog.datumbox.com/vernamveil-a-fresh-take-on-function-based-encryption/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 26 Apr 2025 14:56:04 +0000</pubDate>
				<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">https://blog.datumbox.com/?p=1063</guid>

					<description><![CDATA[Cryptography often feels like an ancient dark art, full of math-heavy concepts, rigid key sizes, and strict protocols. But what if you could rethink the idea of a &#8220;key&#8221; entirely? What if the key wasn’t a fixed blob of bits, but a living, breathing function? VernamVeil is an experimental cipher that explores exactly this idea. [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/vernamveil-a-fresh-take-on-function-based-encryption/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>The journey of Modernizing TorchVision – Memoirs of a TorchVision developer – 3</title>
		<link>https://blog.datumbox.com/the-journey-of-modernizing-torchvision-memoirs-of-a-torchvision-developer-3/</link>
					<comments>https://blog.datumbox.com/the-journey-of-modernizing-torchvision-memoirs-of-a-torchvision-developer-3/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 21 May 2022 16:27:24 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">https://blog.datumbox.com/?p=1048</guid>

					<description></description>
		
					<wfw:commentRss>https://blog.datumbox.com/the-journey-of-modernizing-torchvision-memoirs-of-a-torchvision-developer-3/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>A sneak peek at TorchVision v0.11 – Memoirs of a TorchVision developer – 2</title>
		<link>https://blog.datumbox.com/a-sneak-peek-on-torchvision-v0-11-memoirs-of-a-torchvision-developer-2/</link>
					<comments>https://blog.datumbox.com/a-sneak-peek-on-torchvision-v0-11-memoirs-of-a-torchvision-developer-2/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sun, 10 Oct 2021 12:49:19 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">https://blog.datumbox.com/?p=1032</guid>

					<description><![CDATA[The last couple of weeks were super busy in “PyTorch Land” as we are frantically preparing the release of PyTorch v1.10 and TorchVision v0.11. In this 2nd instalment of the series, I’ll cover some of the upcoming features that are currently included in the release branch of TorchVision. Disclaimer: Though the upcoming release is packed [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/a-sneak-peek-on-torchvision-v0-11-memoirs-of-a-torchvision-developer-2/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>New Blog series – Memoirs of a TorchVision developer</title>
		<link>https://blog.datumbox.com/new-blog-series-memoirs-of-a-torchvision-developer/</link>
					<comments>https://blog.datumbox.com/new-blog-series-memoirs-of-a-torchvision-developer/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 21 Aug 2021 12:33:20 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">https://blog.datumbox.com/?p=1017</guid>

					<description><![CDATA[I’m starting a new blog post series about the development of PyTorch’s computer vision library. I plan to discuss interesting upcoming features primarily from TorchVision and secondary from the PyTorch ecosystem. My target is to highlight new and in-development features and provide clarity of what’s happening in between the releases. Though the format is likely [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/new-blog-series-memoirs-of-a-torchvision-developer/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>How to take S3 backups with DejaDup on Ubuntu 20.10</title>
		<link>https://blog.datumbox.com/how-to-take-s3-backups-with-dejadup-on-ubuntu-20-10/</link>
					<comments>https://blog.datumbox.com/how-to-take-s3-backups-with-dejadup-on-ubuntu-20-10/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sun, 18 Oct 2020 18:27:25 +0000</pubDate>
				<category><![CDATA[Linux]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=974</guid>

					<description><![CDATA[DejaDup is the default backup application for Gnome. It’s a GUI for duplicity, focuses on simplicity, supports incremental encrypted backups and up until recently supported a large number of cloud providers. Unfortunately as of version 42.0, all major cloud providers have been removed. Thus given that Ubuntu 20.10 ships with the specific version, any user [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/how-to-take-s3-backups-with-dejadup-on-ubuntu-20-10/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title>Datumbox Machine Learning Framework v0.8.2 released</title>
		<link>https://blog.datumbox.com/datumbox-machine-learning-framework-v0-8-2-released/</link>
					<comments>https://blog.datumbox.com/datumbox-machine-learning-framework-v0-8-2-released/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Wed, 05 Aug 2020 21:10:12 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=960</guid>

					<description><![CDATA[The Datumbox Framework v0.8.2 has been released! Download it now from GitHub or Maven Central Repository. What is new? The version 0.8.2 is a limited incremental release that focuses on resolving bugs and updating the dependencies of the framework. Here are the details: Bug Fixes: Resolved an issue on ShapiroWilk which led to the incorrect [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/datumbox-machine-learning-framework-v0-8-2-released/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>How to get around Dropbox’s symlink limitations on Linux</title>
		<link>https://blog.datumbox.com/how-to-get-around-dropbox-symlink-limitations-on-linux/</link>
					<comments>https://blog.datumbox.com/how-to-get-around-dropbox-symlink-limitations-on-linux/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 22 Feb 2020 14:15:56 +0000</pubDate>
				<category><![CDATA[Linux]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=925</guid>

					<description><![CDATA[As of mid-2019, Dropbox announced that they no longer support symlinks that point outside of the main Dropbox folder. In this tutorial, we show a workaround on Linux that enables us to store in Dropbox any file, even if it is not located within the main Dropbox folder. What is the limitation and why it’s [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/how-to-get-around-dropbox-symlink-limitations-on-linux/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
		<item>
		<title>The Batch Normalization layer of Keras is broken</title>
		<link>https://blog.datumbox.com/the-batch-normalization-layer-of-keras-is-broken/</link>
					<comments>https://blog.datumbox.com/the-batch-normalization-layer-of-keras-is-broken/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Tue, 17 Apr 2018 20:59:57 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=890</guid>

					<description><![CDATA[UPDATE: Unfortunately my Pull-Request to Keras that changed the behaviour of the Batch Normalization layer was not accepted. You can read the details here. For those of you who are brave enough to mess with custom implementations, you can find the code in my branch. I might maintain it and merge it with the latest [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/the-batch-normalization-layer-of-keras-is-broken/feed/</wfw:commentRss>
			<slash:comments>31</slash:comments>
		
		
			</item>
		<item>
		<title>5 tips for multi-GPU training with Keras</title>
		<link>https://blog.datumbox.com/5-tips-for-multi-gpu-training-with-keras/</link>
					<comments>https://blog.datumbox.com/5-tips-for-multi-gpu-training-with-keras/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sun, 21 Jan 2018 16:19:00 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=866</guid>

					<description><![CDATA[Deep Learning (the favourite buzzword of late 2010s along with blockchain/bitcoin and Data Science/Machine Learning) has enabled us to do some really cool stuff the last few years. Other than the advances in algorithms (which admittedly are based on ideas already known since 1990s aka “Data Mining era”), the main reasons of its success can [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/5-tips-for-multi-gpu-training-with-keras/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
		<item>
		<title>Ubuntu 17.10: a last minute review</title>
		<link>https://blog.datumbox.com/ubuntu-17-10-a-last-minute-review/</link>
					<comments>https://blog.datumbox.com/ubuntu-17-10-a-last-minute-review/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sun, 08 Oct 2017 22:49:23 +0000</pubDate>
				<category><![CDATA[Linux]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=824</guid>

					<description><![CDATA[On October 19 2017, Ubuntu 17.10 will be released and as many of you know it packs lots of significant changes. I spend a week testing the Beta 2 and in this “last minute” review, I document some of the less obvious features/gotchas of Ubuntu 17.10. I also share with you my experience and provide [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/ubuntu-17-10-a-last-minute-review/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title>Datumbox Machine Learning Framework v0.8.1 released</title>
		<link>https://blog.datumbox.com/datumbox-machine-learning-framework-v0-8-1-released/</link>
					<comments>https://blog.datumbox.com/datumbox-machine-learning-framework-v0-8-1-released/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Thu, 31 Aug 2017 21:28:09 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=812</guid>

					<description><![CDATA[The Datumbox v0.8.1 has been released! Download it now from Github or Maven Central Repository. What is new? The main focus of version 0.8.1 is to resolve various bugs, update the depedencies and improve the code architecture of the framework. Here are the details: Dependencies: Updated the Maven Compiler, Nexus Staging, Surefire, SLF4J and Logback [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/datumbox-machine-learning-framework-v0-8-1-released/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
		<item>
		<title>Drilling into Spark’s ALS Recommendation algorithm</title>
		<link>https://blog.datumbox.com/drilling-into-sparks-als-recommendation-algorithm/</link>
					<comments>https://blog.datumbox.com/drilling-into-sparks-als-recommendation-algorithm/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 25 Feb 2017 14:15:55 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=788</guid>

					<description><![CDATA[The ALS algorithm introduced by Hu et al., is a very popular technique used in Recommender System problems, especially when we have implicit datasets (for example clicks, likes etc). It can handle large volumes of data reasonably well and we can find many good implementations in various Machine Learning frameworks. Spark includes the algorithm in [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/drilling-into-sparks-als-recommendation-algorithm/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title>Getting the GPU usage of NVIDIA cards with the Linux dstat tool</title>
		<link>https://blog.datumbox.com/getting-the-gpu-usage-of-nvidia-cards-with-the-linux-dstat-tool/</link>
					<comments>https://blog.datumbox.com/getting-the-gpu-usage-of-nvidia-cards-with-the-linux-dstat-tool/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 11 Feb 2017 23:21:10 +0000</pubDate>
				<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=773</guid>

					<description><![CDATA[The dstat is an awesome little tool which allows you to get resource statistics for your Linux box. It has a modular architecture which allows you to develop additional plugins and it’s easy to use. Recently I was profiling a Deep Learning pipeline developed with Keras and Tensorflow and I needed detailed statistics about the [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/getting-the-gpu-usage-of-nvidia-cards-with-the-linux-dstat-tool/feed/</wfw:commentRss>
			<slash:comments>6</slash:comments>
		
		
			</item>
		<item>
		<title>Datumbox Machine Learning Framework version 0.8.0 released</title>
		<link>https://blog.datumbox.com/datumbox-machine-learning-framework-version-0-8-0-released/</link>
					<comments>https://blog.datumbox.com/datumbox-machine-learning-framework-version-0-8-0-released/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sun, 15 Jan 2017 01:45:53 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=760</guid>

					<description><![CDATA[Datumbox Framework v0.8.0 is out and packs several powerful features! This version brings new Preprocessing, Feature Selection and Model Selection algorithms, new powerful Storage Engines that give better control on how the Models and the Dataframes are saved/loaded, several pre-trained Machine Learning models and lots of memory &#38; speed improvements. Download it now from Github [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/datumbox-machine-learning-framework-version-0-8-0-released/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
		<item>
		<title>Datumbox Machine Learning Framework 0.7.0 Released</title>
		<link>https://blog.datumbox.com/datumbox-machine-learning-framework-0-7-0-released/</link>
					<comments>https://blog.datumbox.com/datumbox-machine-learning-framework-0-7-0-released/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 19 Mar 2016 12:17:12 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=744</guid>

					<description><![CDATA[I am really excited to announce that, after several months of development, the new version of Datumbox is out! The 0.7.0 version brings multi-threading support, fast disk-based training for datasets that don&#8217;t fit in memory, several algorithmic enhancements and better architecture. Download it now from Github or Maven Central Repository. What is new? The focus [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/datumbox-machine-learning-framework-0-7-0-released/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
		<item>
		<title>Datumbox Machine Learning Framework 0.6.1 Released</title>
		<link>https://blog.datumbox.com/datumbox-machine-learning-framework-0-6-1-released/</link>
					<comments>https://blog.datumbox.com/datumbox-machine-learning-framework-0-6-1-released/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sat, 02 Jan 2016 16:54:21 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=734</guid>

					<description><![CDATA[The new version of Datumbox Machine Learning Framework has been released! Download it now from Github or Maven Central Repository. What is new? The main focus of version 0.6.1 is to resolve various bugs, reduce memory consumption and improve speed. Let&#8217;s see in detail the changes of this version: Bug Fixes: A minor issue related [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/datumbox-machine-learning-framework-0-6-1-released/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Datumbox Machine Learning Framework 0.6.0 Released</title>
		<link>https://blog.datumbox.com/datumbox-machine-learning-framework-0-6-0-released/</link>
					<comments>https://blog.datumbox.com/datumbox-machine-learning-framework-0-6-0-released/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 04 May 2015 10:04:07 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=704</guid>

					<description><![CDATA[The new version of Datumbox Machine Learning Framework has been released! Download it now from Github or Maven Central Repository. What is new? The main focus of version 0.6.0 is to extend the Framework to handle Large Data, improve the code architecture and the public APIs, simplify data parsing, enhance the documentation and move to [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/datumbox-machine-learning-framework-0-6-0-released/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>How to install and use the Datumbox Machine Learning Framework</title>
		<link>https://blog.datumbox.com/how-to-install-and-use-the-datumbox-machine-learning-framework/</link>
					<comments>https://blog.datumbox.com/how-to-install-and-use-the-datumbox-machine-learning-framework/#comments</comments>
		
		<dc:creator><![CDATA[Eleftherios Bampaletakis]]></dc:creator>
		<pubDate>Sun, 09 Nov 2014 18:32:56 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=640</guid>

					<description><![CDATA[In this guide we are going to discuss how to install and use the Datumbox Machine Learning framework in your Java projects. Since almost all of the code is written in Java, using it is as simple as including it as dependency in your Java project. Nevertheless a couple of classes (DataEnvelopmentAnalysis and LPSolver) use [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/how-to-install-and-use-the-datumbox-machine-learning-framework/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
		<item>
		<title>New open-source Machine Learning Framework written in  Java</title>
		<link>https://blog.datumbox.com/new-open-source-machine-learning-framework-written-in-java/</link>
					<comments>https://blog.datumbox.com/new-open-source-machine-learning-framework-written-in-java/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Sun, 19 Oct 2014 10:50:22 +0000</pubDate>
				<category><![CDATA[Framework]]></category>
		<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=620</guid>

					<description><![CDATA[I am happy to announce that the Datumbox Machine Learning Framework is now open sourced under GPL 3.0 and you can download its code from Github! What is this Framework? The Datumbox Machine Learning Framework is an open-source framework written in Java which enables the rapid development of Machine Learning models and Statistical applications. It [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/new-open-source-machine-learning-framework-written-in-java/feed/</wfw:commentRss>
			<slash:comments>5</slash:comments>
		
		
			</item>
		<item>
		<title>Clustering with Dirichlet Process Mixture Model in Java</title>
		<link>https://blog.datumbox.com/clustering-with-dirichlet-process-mixture-model-in-java/</link>
					<comments>https://blog.datumbox.com/clustering-with-dirichlet-process-mixture-model-in-java/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 07 Jul 2014 09:36:57 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=603</guid>

					<description><![CDATA[In the previous articles we discussed in detail the Dirichlet Process Mixture Models and how they can be used in cluster analysis. In this article we will present a Java implementation of two different DPMM models: the Dirichlet Multivariate Normal Mixture Model which can be used to cluster Gaussian data and the Dirichlet-Multinomial Mixture Model [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/clustering-with-dirichlet-process-mixture-model-in-java/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
		<item>
		<title>Clustering documents and gaussian data with Dirichlet Process Mixture Models</title>
		<link>https://blog.datumbox.com/clustering-documents-and-gaussian-data-with-dirichlet-process-mixture-models/</link>
					<comments>https://blog.datumbox.com/clustering-documents-and-gaussian-data-with-dirichlet-process-mixture-models/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 30 Jun 2014 09:31:39 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=588</guid>

					<description><![CDATA[This article is the fifth part of the tutorial on Clustering with DPMM. In the previous posts we covered in detail the theoretical background of the method and we described its mathematical representationsmu and ways to construct it. In this post we will try to link the theory with the practice by introducing two models [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/clustering-documents-and-gaussian-data-with-dirichlet-process-mixture-models/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>The Dirichlet Process Mixture Model</title>
		<link>https://blog.datumbox.com/the-dirichlet-process-mixture-model/</link>
					<comments>https://blog.datumbox.com/the-dirichlet-process-mixture-model/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 23 Jun 2014 08:24:21 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=568</guid>

					<description><![CDATA[This blog post is the fourth part of the series on Clustering with Dirichlet Process Mixture Models. In previous articles we discussed the Finite Dirichlet Mixture Models and we took the limit of their model for infinite k clusters which led us to the introduction of Dirichlet Processes. As we saw, our target is to [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/the-dirichlet-process-mixture-model/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
		<item>
		<title>The Dirichlet Process the Chinese Restaurant Process and other representations</title>
		<link>https://blog.datumbox.com/the-dirichlet-process-the-chinese-restaurant-process-and-other-representations/</link>
					<comments>https://blog.datumbox.com/the-dirichlet-process-the-chinese-restaurant-process-and-other-representations/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Tue, 20 May 2014 11:26:28 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=543</guid>

					<description><![CDATA[This article is the third part of the series on Clustering with Dirichlet Process Mixture Models. The previous time we defined the Finite Mixture Model based on Dirichlet Distribution and we posed questions on how we can make this particular model infinite. We briefly discussed the idea of taking the limit of the model when [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/the-dirichlet-process-the-chinese-restaurant-process-and-other-representations/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Finite Mixture Model based on Dirichlet Distribution</title>
		<link>https://blog.datumbox.com/finite-mixture-model-based-on-dirichlet-distribution/</link>
					<comments>https://blog.datumbox.com/finite-mixture-model-based-on-dirichlet-distribution/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 12 May 2014 10:07:09 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=526</guid>

					<description><![CDATA[This blog post is the second part of an article series on Dirichlet Process mixture models. In the previous article we had an overview of several Cluster Analysis techniques and we discussed some of the problems/limitations that rise by using them. Moreover we briefly presented the Dirichlet Process Mixture Models, we talked about why they [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/finite-mixture-model-based-on-dirichlet-distribution/feed/</wfw:commentRss>
			<slash:comments>4</slash:comments>
		
		
			</item>
		<item>
		<title>Overview of Cluster Analysis and Dirichlet Process Mixture Models</title>
		<link>https://blog.datumbox.com/overview-of-cluster-analysis-and-dirichlet-process-mixture-models/</link>
					<comments>https://blog.datumbox.com/overview-of-cluster-analysis-and-dirichlet-process-mixture-models/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 05 May 2014 10:36:34 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=512</guid>

					<description><![CDATA[In the ISO research project for my MSc in Machine Learning at Imperial College London, I focused on the problem of Cluster Analysis by using Dirichlet Process Mixture Models. The DPMMs is a “fully-Bayesian” unsupervised learning technique which unlike other Cluster Analysis methods does not require us to predefine the total number of clusters within [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/overview-of-cluster-analysis-and-dirichlet-process-mixture-models/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title>Using Artificial Intelligence to solve the 2048 Game (JAVA code)</title>
		<link>https://blog.datumbox.com/using-artificial-intelligence-to-solve-the-2048-game-java-code/</link>
					<comments>https://blog.datumbox.com/using-artificial-intelligence-to-solve-the-2048-game-java-code/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 07 Apr 2014 08:57:55 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=476</guid>

					<description><![CDATA[By now most of you have heard/played the 2048 game by Gabriele Cirulli. It’s a simple but highly addictive board game which requires you to combine the numbers of the cells in order to reach the number 2048. As expected the difficulty of the game increases as more cells are filled with high values. Personally [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/using-artificial-intelligence-to-solve-the-2048-game-java-code/feed/</wfw:commentRss>
			<slash:comments>6</slash:comments>
		
		
			</item>
		<item>
		<title>Measuring the Social Media Popularity of Pages with DEA in JAVA</title>
		<link>https://blog.datumbox.com/measuring-the-social-media-popularity-of-pages-with-dea-in-java/</link>
					<comments>https://blog.datumbox.com/measuring-the-social-media-popularity-of-pages-with-dea-in-java/#respond</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 03 Mar 2014 10:36:57 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Online Marketing]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=458</guid>

					<description><![CDATA[In the previous article we have discussed about the Data Envelopment Analysis technique and we have seen how it can be used as an effective non-parametric ranking algorithm. In this blog post we will develop an implementation of Data Envelopment Analysis in JAVA and we will use it to evaluate the Social Media Popularity of [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/measuring-the-social-media-popularity-of-pages-with-dea-in-java/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Data Envelopment Analysis Tutorial</title>
		<link>https://blog.datumbox.com/data-envelopment-analysis-tutorial/</link>
					<comments>https://blog.datumbox.com/data-envelopment-analysis-tutorial/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 24 Feb 2014 10:52:54 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=433</guid>

					<description><![CDATA[Data Envelopment Analysis, also known as DEA, is a non-parametric method for performing frontier analysis. It uses linear programming to estimate the efficiency of multiple decision-making units and it is commonly used in production, management and economics. The technique was first proposed by Charnes, Cooper and Rhodes in 1978 and since then it became a [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/data-envelopment-analysis-tutorial/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title>How to build your own Facebook Sentiment Analysis Tool</title>
		<link>https://blog.datumbox.com/how-to-build-your-own-facebook-sentiment-analysis-tool/</link>
					<comments>https://blog.datumbox.com/how-to-build-your-own-facebook-sentiment-analysis-tool/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 03 Feb 2014 10:02:30 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Online Marketing]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=410</guid>

					<description><![CDATA[In this article we will discuss how you can build easily a simple Facebook Sentiment Analysis tool capable of classifying public posts (both from users and from pages) as positive, negative and neutral. We are going to use Facebook’s Graph API Search and the Datumbox API 1.0v. Similar to the Twitter Sentiment Analysis tool that [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/how-to-build-your-own-facebook-sentiment-analysis-tool/feed/</wfw:commentRss>
			<slash:comments>7</slash:comments>
		
		
			</item>
		<item>
		<title>Developing a Naive Bayes Text Classifier in JAVA</title>
		<link>https://blog.datumbox.com/developing-a-naive-bayes-text-classifier-in-java/</link>
					<comments>https://blog.datumbox.com/developing-a-naive-bayes-text-classifier-in-java/#comments</comments>
		
		<dc:creator><![CDATA[Vasilis Vryniotis]]></dc:creator>
		<pubDate>Mon, 27 Jan 2014 09:41:34 +0000</pubDate>
				<category><![CDATA[Machine Learning & Statistics]]></category>
		<category><![CDATA[Programming]]></category>
		<guid isPermaLink="false">http://blog.datumbox.com/?p=385</guid>

					<description><![CDATA[In previous articles we have discussed the theoretical background of Naive Bayes Text Classifier and the importance of using Feature Selection techniques in Text Classification. In this article, we are going to put everything together and build a simple implementation of the Naive Bayes text classification algorithm in JAVA. The code of the classifier is [&#8230;]]]></description>
		
					<wfw:commentRss>https://blog.datumbox.com/developing-a-naive-bayes-text-classifier-in-java/feed/</wfw:commentRss>
			<slash:comments>16</slash:comments>
		
		
			</item>
	</channel>
</rss>