The author wrote before about the benefit of employing optimistic locking for MongoDB batch processors. The optimistic locking exception is a recoverable one, as long as you fetch the latest Entity, update and save it. Spring makes it easy to implement an automatic retry mechanism, and this is how he did it.
The author's previous post on recommendation systems suffers from a lack of diversity. For example, a list may contain the same book as a soft cover, hard cover, and Kindle version. Because interests are diverse, a better recommendation list should contain items that cover a broad spectrum of the user's interests
This installment of Arthur Charpentier's regular collection of data science-related links includes analyzing baseball data with R, a profile of a sword-swallowing statistician, and the technology behind Twitter that creates a massive network of data.
We are doing some more performance work in Voron. And we got some really surprising results there. Voron is writing at really good rate, (better than anything else we tested against), just not a good enough rate.
Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone! This week's best include the open source announcement of Facebook's Presto, an analysis of the multi-armed bandit algorithm, and practical uses for Big Data in terms of social and economic efficiency.
Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a look at what happened to all the buzz around NoSQL, a story about the perils of using MongoDB, and an objective measure of the most popular SQL and NoSQL database engines.
Using Packer to create CentOS and Ubuntu boxes is not difficult. If you want to follow along, I have prepared a Git repository ariya/packer-vagrant-linux which contains all the necessary bits to create CentOS 5.4 and/or Ubuntu 12.04 LTS 64-bit boxes.
Logentries processes over 10 billion log events every day. That’s quite a lot of data. So, the Logentries research team decided to take advantage of their unique position and set out to examine a sample of their overall user base for insights: 22 billion log events from over 6,000 Heroku applications.
The author was asked "how the whole 'Hacker News MongoDB random bashing' situation was dealt with from the inside." In this article, she explains her reaction and her strategies for handling such issues during her time at MongoDB.
Managers don’t want to think harder than they have to. They like simple rules of thumb. One of the most useful rules of thumb is the 80:20 rule. You can see obvious cases where the 80:20 rule applies in software without looking too hard. For example, 80% of performance improvements are found by optimizing 20% of the code.
Yesterday someone was looking at my code and said “uh, you’re using +T in a generic”, and I said “that’s a hipster feature of Scala, you don’t need to understand it, you just need to get it right so your code compiles.”
In my previous post I installed and configured Ghost and WordPress. The purpose of that blog post was to test relative performance of the 2 platforms to see which one could handle more load. Many requested another test where an opcode cache was in place for WordPress. So that is exactly what this blog post is about.
After finishing up the major change of moving Voron to a Write Ahead Journal, it was time to actually start doing some performance testing. To make things interesting, I decided that we shouldn’t just compare this in isolation, but we should actually compare it to its peers.
We started working on a CloudStack backend for rOCCI using our CloudStack Ruby gem. The choice of rOCCI was made due to the existence of an existing Opennebula backend and the adoption of OCCI in the European Grid Initiative Federated cloud testbed.