• submit to reddit
Alec Noller01/29/14
8924 views
0 replies

Dev of the Week: Alex Curylo

Every week here and in our newsletter, we feature a new developer/blogger from the DZone community to catch up and find out what he or she is working on now and what's coming next. This week we're talking to Alex Curylo, long-time Mac expert and contract iPhone programmer in Vancouver, BC, Canada.

Wojciech Kudla01/29/14
5648 views
0 replies

How to Serialize an Array of Doubles to a Byte

Low latency systems require high performance message processing and passing. The technique presented in this article is a good example of how leveraging some aspects of the data benefits encoding in both latency and space complexity.

Tim Eisenhauer01/29/14
3762 views
0 replies

Problems with Free and Open Source Collaboration Platforms

Recently, one of my government clients decided to switch their public facing ASP.NET to Drupal (an open source solution), and they had an embarrassing hacking scenario. There are a number of disadvantages to open source systems that you should be aware of before making the switch.

Vlad Mihalcea01/28/14
9134 views
0 replies

MongoDB and the Fine Art of Data Modelling

When you first use MongoDB, you’ll immediately notice its schema-less data model. But schema-less doesn’t mean skipping proper data modelling. As opposed to a SQL database, a NoSQL document model is more focused on querying than data normalization. That’s why your design won’t be finished unless it addresses your data querying patterns.

Andreas Kollegger01/27/14
5506 views
0 replies

Neo4j 2.0: Importing Data the Spreadsheet Way!

It's a new year, and in December, Neo4j launched its shiny new version: the 2.0.0 release. So, in this article you'll find an update to Rik Van Bruggen's previous tutorial on how to import data into Neo4j using spreadsheets, this one written for Neo4j 2.0.0.

Alec Noller01/27/14
4717 views
0 replies

Quickly Removing Duplicates from MongoDB

If you've acquired some duplicates in MongoDB that you want to get rid of, this recent post provides a how-to on cleaning them up. The best option, obviously, is not to duplicate things in the first place - you're welcome - but the post is focused on solving the problem after the fact.

Ayende Rahien01/27/14
5923 views
0 replies

Big Data Search, Part 5: Sorting Optimizations

The author mentioned several times that the entire point of his previous exercise was to just see how it works, not to actually do anything production-worthy. But it is interesting to see what could be done better here.

Alec Noller01/26/14
7059 views
0 replies

The Best of the Week (Jan 17): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include a beginner's guide to MongoDB performance turbocharging, working with Node.js and N1QL, a look at an extremely configurable diff tool, and more.

Mark Needham01/24/14
7588 views
0 replies

Neo4j Backup: Store Copy and Consistency Check

One of the lesser-known things about the Neo4j online backup tool is that conceptually there are two parts to it: Copying the store files to a location of your choice, and verifying that those store files are consistent. Both of these run when you run the ‘neo4j-backup’ script, but it can be useful to run them separately.

Ayende Rahien01/24/14
9753 views
0 replies

Big Data Search, Part 4: The Index Format is Horrible

The author has finished his own Big Data Search exercise, and it's interesting to see just how far out there the code is. It isn't something that you can use for much except as a basis to see how badly you are doing. Let's start with the index format: A CSV file with the value and the position in the original file.

Alec Noller01/24/14
7929 views
0 replies

Julia: A High-Performance Programming Language for Data Science and More

Data scientists working with R and Python, as well as anybody looking for interesting, new-ish, high-performance programming languages should look into the not-as-much-discussed Julia. This recent blog posts discusses the merits and uses of Julia.

Evert Pot01/23/14
20822 views
1 replies

Google Code is Dead

Last year Google announced they would no longer provide a 'downloads' feature on google code hosting for new projects. Starting January 15th 2014, creating new downloads would also no longer be possible for existing projects.

Jalpesh Vadgama01/23/14
14188 views
0 replies

Why Continuous Integration is Your Friend

In this post we are going to learn the benefits of Continuous Integration in Software Development. Let’s understand what continuous integration is first and then we will discuss the benefits we are having.

Hardik Pandya01/23/14
18608 views
0 replies

How to Set Up a Multi-Node Hadoop Cluster on Amazon EC2, Part 1

After spending some time playing around on a Single-Node pseudo-distributed cluster, it's time to get into real world Hadoop. It's important to note that there are multiple ways to achieve this, and the author is going to cover how to set up a multi-node Hadoop cluster on Amazon EC2.

Mark Needham01/23/14
4863 views
0 replies

Neo4j Backup: Dealing with java.lang.ClassCastException

When using Neo4j’s online backup facility there are two ways of triggering java.lang.ClassCastException, either by using the ‘single://‘ or ‘ha://‘ syntax, and these behave slightly differently. In this article, you'll learn how to handle these exceptions.