Microsoft Azure Cosmos DB is Microsoft's Big Data analysis platform. It is a NoSQL database service and is a replacement for the earlier DocumentDB NoSQL database.
N/A
Pricing
Apache Cassandra
Azure Cosmos DB
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Cassandra
Azure Cosmos DB
Free Trial
No
No
Free/Freemium Version
No
No
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
No setup fee
Additional Details
—
—
More Pricing Information
Community Pulse
Apache Cassandra
Azure Cosmos DB
Features
Apache Cassandra
Azure Cosmos DB
NoSQL Databases
Comparison of NoSQL Databases features of Product A and Product B
Cassandra excels in a broad range of applications -- especially if you understand its data model and write your applications accordingly. It's an excellent choice for time-series data, and a poor choice for application queues. It performs the best if you can simply record history and compute from it, rather than going back and editing or deleting things a lot.
NoSQL platforms are very useful when it comes to security, speed, accuracy, high accessibility with high read and write power. Everything is managed under the cloud and we have the various capabilities of Azure and support for Microsoft products with us. Flexibility in price and variety of features, as well as real-time results, are some of the popular [features] of this platform.
High Availability - we utilize the data replication features of Cassandra. This enables us to access our data even when several nodes have gone down
Data Locality - our architecture combines Cassandra storage nodes and computation nodes in the same machine. This enables us to utilize data locality and limit expensive network IO to read data.
Elasticity - Cassandra is a shared nothing architecture. Nodes can be added very easily and they discover the network topology. As soon as a node has joined the Cassandra ring, the data is redistributed among the existing nodes and streamed to it automatically.
Turn-key geo-redundancy with multi-master writes is unprecedented and unparalleled in the industry!
Guaranteed low latency makes Cosmos DB an excellent fit for most of our performance-intensive situations.
The tunable consistency model simplifies so many challenges in distributed systems engineering that otherwise require advanced knowledge of computer science topics. I continue to be impressed at how Cosmos DB has abstracted away so much complexity.
No Ad-Hoc Queries: Cassandra data storage layer is basically a key-value storage system. This means that you must "model" your data around the queries you want to surface, rather than around the structure of the data itself.
There are no aggregations queries available in Cassandra.
I would recommend Cassandra DB to those who know their use case very well, as well as know how they are going to store and retrieve data. If you need a guarantee in data storage and retrieval, and a DB that can be linearly grown by adding nodes across availability zones and regions, then this is the database you should choose.
It's efficient, easy to scale, and works. We do have to do a bit of administration, but less now than when we started with this a couple of years ago. Microsoft continues to improve its self-management capability.
Like I said, Cosmos is the way to go. From all of the services that Azure has, Cosmos is very robust in terms of usability. It's ever-evolving and integrates with other applications seamlessly. The interface is pretty easy to understand. I implemented various solutions for my company and Cosmos was one of them.
The support team is very responsive and we are generally satisfied with Microsoft support, in my opinion support team of a product and service is just as valuable as its quality and performance. Telephone answering, 24-hour hotline, email support and ticketing are excellent.
Apache Cassandra has the best of both worlds, it is a Java based NoSQL, linearly scalable, best in class tunable performance across different workloads, fault tolerant, distributed, masterless, time series database. We have used both Apache HBase and MongoDB for some use cases which were within hadoop setup and JSON (JavaScript Object Notation) document store respectively, but given the overall factors favoring Apache Cassandra, it is a technology choice for multiple platforms!
The open source version of Cassandra is only suggested for learning the basic concepts and play with its core features. Unless you really want to invest a lot in your developers and architects knowing every detail of Cassandra, I prefer the DataStax enterprise version. Although the license cost is relatively high, I think they it is worth it. I'm thinking about the support, the monitoring tool OpsCenter, and the integration of Solr and Spark (for data analysis).
Cassandra didn't fully replace our old and traditional relation database Oracle. In addition, it opens another door for us to deal with some special business use cases that NoSQL database can do better in a more feasible and efficient way.