Wikibon Principal Research Contributor Jeff Kelly provides an inclusive basic tutorial of the big data environment, including technologies, skill sets, and use cases, in “Big Data: Hadoop, Business Analytics and Beyond”, and while the environment starts with Hadoop and Map Reduce, it extends far beyond that. While parts of this report may seem basic to technical readers, it provides an excellent overview of technologies, pros and cons, and market issues such as the lack of trained technical personnel and data scientists.
Showing posts with label HADOOP. Show all posts
Showing posts with label HADOOP. Show all posts
Wednesday, February 12, 2014
Tuesday, February 11, 2014
[forbes] Big Data: 5 Reasons Why Hadoop Is Ready for Enterprise Prime Time
As 2014 gets into full swing, Hadoop is increasingly being used for applications that are integral to daily business operations. No longer is Hadoop viewed by some organizations as just a platform for big data proof-of-concept applications. IT leaders should be developing a strategy for production-ready infrastructure, now, so they are ready to leverage the emerging technical advances that make Hadoop more capable of supporting business-critical big data applications.
[blackhat] BINARYPIG - SCALABLE MALWARE ANALYTICS IN HADOOP
PRESENTED BY
Over the past 2.5 years Endgame received 20M samples of malware equating to roughly 9.5 TB of binary data. In this, we’re not alone. McAfee reports that it currently receives roughly 100,000 malware samples per day and received roughly 10M samples in the last quarter of 2012 [1]. Its total corpus is estimated to be about 100M samples. VirusTotal receives between 300k and 600k unique files per day, and of those roughly one-third to half are positively identified as malware [2].
Subscribe to:
Posts (Atom)