Big Data and NoSQ...
Follow
Find
20.8K views | +0 today
Scoop.it!

Apache Spark for Big Analytics

Apache Spark for Big Analytics | Big Data and NoSQL Daily | Scoop.it
by Thomas Dinsmore, Director of Product Management at Revolution Analytics The emergence of Apache Spark is a key development for Big Analytics in 2013.
Simon Hunanyan's insight:

Spark, an Apache incubator project, is an open source distributed computing framework for advanced analytics in Hadoop. It's 100X faster than what they are able to achieve with MapReduce. Spark includes a machine learning library (MLLib), a graph engine (GraphX), a streaming analytics engine (Spark Streaming) and much more...

Currently, Spark supports programming interfaces for Scala, Java and Python.  The R interface is under development and this is expected to be released in the first half of 2014.

more...
No comment yet.
Big Data and NoSQL Daily
Daily compilation of articles about NoSQL, HBase, Hadoop, MongoDB and other Big Data tools and trends
Curated by Monitis
Your new post is loading...
Scoop.it!

Teradata says Hadoop is good for business -- but for how long?

Teradata says Hadoop is good for business -- but for how long? | Big Data and NoSQL Daily | Scoop.it
Teradata announced a new set of features and products on Monday that should improve its position as a go-to analytics vendor even in an age of Hadoop. But as open sources technologies evolve, Teradata might face a challenge to attract new users.
more...
No comment yet.
Scoop.it!

MapReduce and R: Short example on regression / forecasting

Going over how to address a use case where there is a need to forecast future performance based on historical trend. This short presen(...)
more...
No comment yet.
Scoop.it!

Self-Service Creation of a Hadoop Cluster using vCAC (VMware vCloud Automation Center) - YouTube

This demonstration will show how the combination of vSphere Big Data Extensions (BDE) and vCloud Automation Center (vCAC) can provide a service catalog that ...
more...
No comment yet.
Scoop.it!

Enterprise Hadoop and the Journey to a Data Lake - Hortonworks

Enterprise Hadoop and the Journey to a Data Lake - Hortonworks | Big Data and NoSQL Daily | Scoop.it
Apache Hadoop didn't disrupt the data center, the data did. In this post we explore Hadoop as part of an integrated, modern data architecture.
more...
No comment yet.
Scoop.it!

When to Use MongoDB Rather than MySQL (or Other RDBMS): The Billing Example | Javalobby

When to Use MongoDB Rather than MySQL (or Other RDBMS): The Billing Example | Javalobby | Big Data and NoSQL Daily | Scoop.it
NoSQL is a hot buzz in the air for a pretty long time (well, it's not only a buzz anymore).

However, when should we really use it?
Best Practices for...
Simon Hunanyan's insight:

Once again, the best practices and tips..

more...
Scoop.it!

To Hadoop, Or Not to Hadoop? That is the Question - Datanami

To Hadoop, Or Not to Hadoop? That is the Question - Datanami | Big Data and NoSQL Daily | Scoop.it
Many businesses are actively researching and planning on implementing a Hadoop solution. Hadoop vendors are also beginning to offer their services in the cloud.
Simon Hunanyan's insight:

In short, author is considering about when is reasonable to use Hadoop...

more...
Webdevilopers's curator insight, March 25, 6:53 AM
Don’t use Hadoop:For collecting and analyzing structured data. Being that Hadoop is designed for processing vast stores of accumulated data, using Hadoop for storing and analyzing data that trickles in at a steady and predictable rate over time would be overkill.
Scoop.it!

Best Practices for Big Data Integration

Everybody has heard about Big Data and it certainly sounds enticing. However, as always, it is a good idea to separate fact from fiction and to know what eff...
Simon Hunanyan's insight:

BI Industry expert Claudia examines the best practices for Big Data integration as the foundation for a successful BI environment.

more...
No comment yet.
Scoop.it!

What is a good way to design a NoSQL database?

What is a good way to design a NoSQL database? | Big Data and NoSQL Daily | Scoop.it
Josep Lluis Larriba Pey's answer: As Chris Shrader says in his answer, you first have to know how to use a NoSQL DB. In this case, from my experience, I can tell you that Graph NoSQL databases are a special case.
more...
No comment yet.
Scoop.it!

How-to: Index and Search Multilingual Documents in Hadoop ...

Cloudera Search brings full-text, interactive search, and scalable indexing to Apache Hadoop by marrying SolrCloud with HDFS and Apache HBase, and other projects in CDH. Because it's integrated with CDH, Cloudera ...
more...
No comment yet.
Scoop.it!

Redis Everywhere - Sunshine PHP

REDIS EVERYWHERE (RT @alsargent: Great overview of when to use #Redis -- and when not to http://t.co/SPTVSRSSdx #NoSQL)
more...
No comment yet.
Scoop.it!

Big Data Platform Comparisons: 3 Key Points - InformationWeek

Big Data Platform Comparisons: 3 Key Points - InformationWeek | Big Data and NoSQL Daily | Scoop.it
ReadWrite
Big Data Platform Comparisons: 3 Key Points
InformationWeek
Our recent 16 Top Big Data Analytics Platforms collection has generated lots of interest and plenty of comments and questions.
more...
No comment yet.
Scoop.it!

MongoDB and MySQL – Comparing Scalability, Data Distribution ...

MongoDB and MySQL – Comparing Scalability, Data Distribution ... | Big Data and NoSQL Daily | Scoop.it
Note: This blog post part 3 of the series and you can download 30-day trial of ScaleBase to practice the concepts. In this article comparing MongoDB and MySQL scalability, I want to focus on query models.
more...
No comment yet.
Scoop.it!

CloudFront: Fun with HBase shell

CloudFront: Fun with HBase shell | Big Data and NoSQL Daily | Scoop.it

HBase shell is great, specially while getting yourself familiar with HBase. It provides lots of useful shell commands using which you can perform trivial tasks like creating tables, putting some test data into it, scanning the whole table, fetching data from a specific row etc etc. 

Sergeyan's insight:

Discusses  HBase shell relatively less known operations.

more...
No comment yet.
Scoop.it!

NoSQL with MySQL

NoSQL with MySQL | Big Data and NoSQL Daily | Scoop.it
Oracle added NoSQL capabilities to the InnoDB engine in MySQL 5.6, providing a 9x improvement in transaction performance. Here's how to use the NoSQL features.
more...
No comment yet.
Scoop.it!

Selecting the right SQL-on-Hadoop engine to access big data

Selecting the right SQL-on-Hadoop engine to access big data | Big Data and NoSQL Daily | Scoop.it
SQL-on-Hadoop engines are available from a variety of vendors. But expert Rick van der Lans cautions that while they appear similar on the surface, there are important differences.
more...
No comment yet.
Scoop.it!

MariaDB adds NoSQL features to relational database roots

MariaDB adds NoSQL features to relational database roots | Big Data and NoSQL Daily | Scoop.it
MariaDB 10 is out, featuring a “Connect engine” that makes it easier to handle data from both traditional SQL databases and more web-scale NoSQL systems. The new functionality merits new editions of the MariaDB Enterprise and Enterprise Cluster products.
more...
No comment yet.
Scoop.it!

Big Data - The 5 Vs Everyone Must Know

This slide deck, by Big Data guru Bernard Marr, outlines the 5 Vs of big data. It describes in simple language what big data is, in terms of Volume, Velocity...
more...
No comment yet.
Scoop.it!

Apache Hadoop 2.3.0 Released! - Hortonworks

Apache Hadoop 2.3.0 Released! - Hortonworks | Big Data and NoSQL Daily | Scoop.it
Announcing the release of Hadoop 2.3.0 with 560 JIRAs fixed.
Simon Hunanyan's insight:

With this release, there are two significant enhancements to HDFS:

  • Support for Heterogeneous Storage Hierarchy in HDFS
  • In-memory Cache for data resident in HDFS via Datanodes 

Besides, there are a lot of bug fixes and small changes ...

more...
No comment yet.
Scoop.it!

Cassandra Vs HBase : Which NoSql store do I need ?

"There many NoSql databases out there and it can be confusing to determine which one is suitable for a particular use case. In this blog, we discuss the two more popular ones, Cassandra and HBase..."

Simon Hunanyan's insight:

The author's summarize is the following:

  • If you have data warehousing type of use cases and large amounts the data that will continue to grow, HBase is a more suitable choice.
  • Cassandra, in generally,  is positioning itself as an alternative to your typical RDBMS.
more...
No comment yet.
Scoop.it!

HBase: The Definitive Guide by Lars George - EbookNetworking.net

HBase: The Definitive Guide by Lars George - EbookNetworking.net | Big Data and NoSQL Daily | Scoop.it
HBase: The Definitive Guide, a book by Lars George (RT @kmap2: HBase: The Definitive Guide/Lars George #books http://t.co/SrPbZBcP8N)
more...
No comment yet.
Scoop.it!

[repost ]Cassandra documentation from DataStax

[repost ]Cassandra documentation from DataStax | Big Data and NoSQL Daily | Scoop.it
original:http://wiki.apache.org/cassandra/GettingStarted DataStax's latest Cassandra documentation covers topics from installation to troubleshooting, including a Quick Start Guide. Documentation for older releases is also available.
more...
No comment yet.
Scoop.it!

5 things everyone should know about Hadoop - GigaOM

5 things everyone should know about Hadoop - GigaOM | Big Data and NoSQL Daily | Scoop.it
5 things everyone should know about Hadoop
GigaOM
It didn't take long for the Hadoop market to become a juggernaut, and it won't take long for it to undergo some significant technological changes.
more...
Webdevilopers's curator insight, March 25, 7:05 AM
Hadoop is coming to the mid-market
Scoop.it!

Apache Hadoop - MapReduce (MR2) and YARN

http://zerotoprotraining.com This video explains the concepts of MapReduce (MR2) and YARN related to Apache Hadoop.
more...
No comment yet.
Scoop.it!

HBase 0.98.0 is Released - Hortonworks

HBase 0.98.0 is Released - Hortonworks | Big Data and NoSQL Daily | Scoop.it
The release of HBase 0.98.0 saw the resolution of 230 JIRA tickets for a lot of new features.
more...
No comment yet.
Scoop.it!

Crunching 30 Years of NBA Data with MongoDB Aggregation

Crunching 30 Years of NBA Data with MongoDB Aggregation | Big Data and NoSQL Daily | Scoop.it
When you are looking to run analytics on large and complex data sets, you might instinctively reach for Hadoop. However, if your data’s in MongoDB, using the Hadoop connector seems like overkill if...
more...
No comment yet.