Showing posts with label HADOOP. Show all posts
Showing posts with label HADOOP. Show all posts

Wednesday, February 12, 2014

[siliconangle] Big data infrastructure goes far beyond Hadoop #BigDataSV

Wikibon Principal Research Contributor Jeff Kelly provides an inclusive basic tutorial of the big data environment, including technologies, skill sets, and use cases, in “Big Data: Hadoop, Business Analytics and Beyond”, and while the environment starts with Hadoop and Map Reduce, it extends far beyond that. While parts of this report may seem basic to technical readers, it provides an excellent overview of technologies, pros and cons, and market issues such as the lack of trained technical personnel and data scientists.

Tuesday, February 11, 2014

[forbes] Big Data: 5 Reasons Why Hadoop Is Ready for Enterprise Prime Time

As 2014 gets into full swing, Hadoop is increasingly being used for applications that are integral to daily business operations. No longer is Hadoop viewed by some organizations as just a platform for big data proof-of-concept applications. IT leaders should be developing a strategy for production-ready infrastructure, now, so they are ready to leverage the emerging technical advances that make Hadoop more capable of supporting business-critical big data applications.

[blackhat] BINARYPIG - SCALABLE MALWARE ANALYTICS IN HADOOP

PRESENTED BY

Over the past 2.5 years Endgame received 20M samples of malware equating to roughly 9.5 TB of binary data. In this, we’re not alone. McAfee reports that it currently receives roughly 100,000 malware samples per day and received roughly 10M samples in the last quarter of 2012 [1]. Its total corpus is estimated to be about 100M samples. VirusTotal receives between 300k and 600k unique files per day, and of those roughly one-third to half are positively identified as malware [2].