• submit to reddit
Rafał Kuć01/30/14
6153 views
0 replies

Apache Solr and Lucene 4.6.1

Today Apache Lucene and Solr PMC announced another version of Apache Lucene library and Apache Solr search server numbred 4.6.1. This is a minor bugfix release. However, it is a recommended update for those using SolrCloud.

Zac Gery01/30/14
6373 views
0 replies

Why Am I Developing This?!?

Let's set the scene: The project has been humming along, and a new request comes in for a minor change. It's completed in quick order, QA signs off, but the customers are frustrated and confused. To find clarity, stop focusing on what did happen and dig a little deeper into what didn't happen.

Alec Noller01/29/14
7713 views
0 replies

Dev of the Week: Alex Curylo

Every week here and in our newsletter, we feature a new developer/blogger from the DZone community to catch up and find out what he or she is working on now and what's coming next. This week we're talking to Alex Curylo, long-time Mac expert and contract iPhone programmer in Vancouver, BC, Canada.

Wojciech Kudla01/29/14
3744 views
0 replies

How to Serialize an Array of Doubles to a Byte

Low latency systems require high performance message processing and passing. The technique presented in this article is a good example of how leveraging some aspects of the data benefits encoding in both latency and space complexity.

Tim Eisenhauer01/29/14
2814 views
0 replies

Problems with Free and Open Source Collaboration Platforms

Recently, one of my government clients decided to switch their public facing ASP.NET to Drupal (an open source solution), and they had an embarrassing hacking scenario. There are a number of disadvantages to open source systems that you should be aware of before making the switch.

Vlad Mihalcea01/28/14
7724 views
0 replies

MongoDB and the Fine Art of Data Modelling

When you first use MongoDB, you’ll immediately notice its schema-less data model. But schema-less doesn’t mean skipping proper data modelling. As opposed to a SQL database, a NoSQL document model is more focused on querying than data normalization. That’s why your design won’t be finished unless it addresses your data querying patterns.

Andreas Kollegger01/27/14
4357 views
0 replies

Neo4j 2.0: Importing Data the Spreadsheet Way!

It's a new year, and in December, Neo4j launched its shiny new version: the 2.0.0 release. So, in this article you'll find an update to Rik Van Bruggen's previous tutorial on how to import data into Neo4j using spreadsheets, this one written for Neo4j 2.0.0.

Alec Noller01/27/14
3117 views
0 replies

Quickly Removing Duplicates from MongoDB

If you've acquired some duplicates in MongoDB that you want to get rid of, this recent post provides a how-to on cleaning them up. The best option, obviously, is not to duplicate things in the first place - you're welcome - but the post is focused on solving the problem after the fact.

Ayende Rahien01/27/14
5212 views
0 replies

Big Data Search, Part 5: Sorting Optimizations

The author mentioned several times that the entire point of his previous exercise was to just see how it works, not to actually do anything production-worthy. But it is interesting to see what could be done better here.

Alec Noller01/26/14
6518 views
0 replies

The Best of the Week (Jan 17): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone! This week's best include a beginner's guide to MongoDB performance turbocharging, working with Node.js and N1QL, a look at an extremely configurable diff tool, and more.

Mark Needham01/24/14
6208 views
0 replies

Neo4j Backup: Store Copy and Consistency Check

One of the lesser-known things about the Neo4j online backup tool is that conceptually there are two parts to it: Copying the store files to a location of your choice, and verifying that those store files are consistent. Both of these run when you run the ‘neo4j-backup’ script, but it can be useful to run them separately.

Ayende Rahien01/24/14
8597 views
0 replies

Big Data Search, Part 4: The Index Format is Horrible

The author has finished his own Big Data Search exercise, and it's interesting to see just how far out there the code is. It isn't something that you can use for much except as a basis to see how badly you are doing. Let's start with the index format: A CSV file with the value and the position in the original file.

Alec Noller01/24/14
6465 views
0 replies

Julia: A High-Performance Programming Language for Data Science and More

Data scientists working with R and Python, as well as anybody looking for interesting, new-ish, high-performance programming languages should look into the not-as-much-discussed Julia. This recent blog posts discusses the merits and uses of Julia.

Evert Pot01/23/14
19196 views
1 replies

Google Code is Dead

Last year Google announced they would no longer provide a 'downloads' feature on google code hosting for new projects. Starting January 15th 2014, creating new downloads would also no longer be possible for existing projects.

Jalpesh Vadgama01/23/14
12409 views
0 replies

Why Continuous Integration is Your Friend

In this post we are going to learn the benefits of Continuous Integration in Software Development. Let’s understand what continuous integration is first and then we will discuss the benefits we are having.