Andrew Brust, Author at Gigaom Your industry partner in emerging technology research Wed, 14 Oct 2020 00:31:08 +0000 en-US hourly 1 https://wordpress.org/?v=6.5.3 Outlook: Big data and analytics in 2015 https://gigaom.com/report/outlook-big-data-and-analytics-in-2015/ Tue, 23 Dec 2014 15:00:36 +0000 http://research.gigaom.com/?post_type=go-report&p=243243/ The year 2015 will be one in which data and analytics technologies become more standardized, more broadly adopted, and more successful.

The post Outlook: Big data and analytics in 2015 appeared first on Gigaom.

]]>
Big data and analytics technologies have been going gangbusters for several years, with companies, funding rounds, technologies, and releases occuring rapid-fire since Hadoop and NoSQL stepped onto the industry stage. Each of the last several years saw an overarching theme in the data arena: 2012 was the year big data became really hot; 2013 was the year it grew more accessible, through SQL-on-Hadoop; and 2014 was the year it became far-more versatile, with the addition of YARN and Spark. 2015 will be the year Hadoop matures.

This maturation process in the data world will involve developments that are less exciting than those we have seen over the last few years. But while 2015 won’t be a breakthrough year, it certainly will be a year of the much-needed building of credibility for newer data and analytics technologies. As such, it will be a year where these technologies become more standardized, more broadly adopted, and more successful. And for vendors in the space, it may be a more lucrative year than those which have preceded it.

Hadoop will become more:

  1. Usable by end-users as it transitions to being more embedded and less-directly exposed
  2. Enterprise-adoptable through such additions as better tooling and true role-based access controls (RBAC)
  3. Developer-friendly through the addition of tools and comprehensive APIs

Not coincidentally, the third point will help advance the first two.

NoSQL databases will mature as well. In fact, so will relational databases. Each will continue to acquire qualities of the other, with a likely outcome that the two categories will converge.

This report will examine the following likely trends affecting the analytics space in 2015:

  • Data governance will become a higher priority, and the conundrum of enforcing governance over ever-growing, real-time streaming data sets will have to be confronted. Current approaches to governance, based on years of data warehousing and OLAP work, won’t be sufficient to the task, and continuing to avoid the governance question won’t work either. New thinking will be required.
  • Hadoop will morph from an infrastructural entity that users must care for as they work with it to a running service that they can use in a task-oriented way. Discrete components, like Hive and Pig, will continue to be useful in their own right but will become constituent services that higher-level, cloud-based self-service platforms will leverage for their own functionality.
  • In much the same way that new user platforms will make Hadoop more self-service, so too will new developer platforms make Hadoop more developer-friendly and more embeddable in line-of-business applications. But proper evangelism and mentoring will be required to get enterprise developers over the big data “hump.”
  • Relational databases will become more adept at handling semi-structured data. Such capability will need to extend beyond support for XML and JSON as datatypes to accommodating schema-less tables alongside conventional ones, in the same databases and the same SQL queries. Likewise, today’s NoSQL databases may need to allow schema-based tables, albeit within the key-value, column family, document store, or graph storage formats that they use today.

Thunbmail image courtesy of nadia/iStock.

The post Outlook: Big data and analytics in 2015 appeared first on Gigaom.

]]>
Big data and analytics third quarter 2014: analysis and outlook https://gigaom.com/report/big-data-and-analytics-third-quarter-2014-analysis-and-outlook/ Tue, 07 Oct 2014 17:03:52 +0000 http://research.gigaom.com/?post_type=go-report&p=238537/ The third quarter showed clear signs of Hadoop’s maturation and the industry’s maturity in working with it. Meanwhile, megavendors continued to build out their big data and analytics portfolios.

The post Big data and analytics third quarter 2014: analysis and outlook appeared first on Gigaom.

]]>
The third quarter of 2014 brought forward much in the way of Hadoop’s maturation and the industry’s maturity in working with it. Whereas in the second quarter there were early signs of the rise of Hadoop 2.0’s YARN resource manager and the decline of MapReduce, this past quarter, that phenomenon was in full effect.

As open source data project momentum increases, the value add of commercial offerings that integrate with, abstract away, and simplify those underlying technologies has grown. Incumbent vendors are moving as quickly as they can to adjust to the changes. NewSQL and NoSQL vendors are carving out niches for themselves as they move beyond the market entry stage.

This report will examine the following areas of activity from the third quarter and their impact on the near term outlook in the analytics space:

  • The continued growth of YARN, combined with increased momentum for Spark, leads a banner quarter in new releases and shorter release cycles for numerous Apache Software Foundation projects.
  • Incumbent vendors made huge strides in the big data arena, with Teradata making acquisitions, Oracle announcing new technologies, new releases and new cloud services, and Microsoft coming to market with three important new releases of its own. Meanwhile, IBM has doubled-, nay tripled-down on Watson.
  • Several data discovery vendors, which Gigaom Research reviewed in a Sector Roadmap earlier this year, have come out with new releases and previews that range from evolutionary to innovative to game-changing.
  • Meanwhile, horizontal competitors in the NewSQL and NoSQL fields have started to go vertical, as standing out amongst the crowd starts to become a major imperative.

 

Thumbnail image courtesy: iStock/Thinkstock

The post Big data and analytics third quarter 2014: analysis and outlook appeared first on Gigaom.

]]>
Big data and analytics second-quarter 2014: analysis and outlook https://gigaom.com/report/big-data-and-analytics-second-quarter-2014-analysis-and-outlook/ Wed, 16 Jul 2014 12:00:16 +0000 http://research.gigaom.com/?post_type=go-report&p=233035/ In the second quarter of 2014, new de facto standards emerged and galvanized, major cloud providers launched new analytics offerings, and mainstream databases began to take on attributes and capabilities of heretofore separate special-purpose products.

The post Big data and analytics second-quarter 2014: analysis and outlook appeared first on Gigaom.

]]>
In the second quarter of 2014, new de facto standards emerged and galvanized, major cloud providers launched new analytics offerings, and mainstream databases began to take on attributes and capabilities of heretofore separate special-purpose products.

Thus is the paradox of the data analytics market right now. New techniques and technologies emerge, but things are nonetheless becoming more integrated. This extends to companies too as the venture capital dollars continue to flow in, but at the same time, a few acquisitions presage a consolidation for the industry.

This report will examine the following areas of activity this quarter and their impact on the near-term outlook in the analytics space:

  • Funding rounds were closed by all three major Hadoop vendors, and there were also smaller rounds by companies in the in-memory analytics and business intelligence spheres. Vendors are still going back to the well, and self-sufficiency is a ways off.
  • Several acquisitions, also spanning the field of Hadoop players and business intelligence competitors, took place this quarter. Consolidation is starting to take root.
  • Each of the three major public cloud providers have either shored up analytics offerings or announced entirely new ones. The cloud, as a data analytics platform, may be belatedly blooming, with customers’ trepidation subsiding.
  • Apache Spark and YARN, two open source projects that are at once different and yet competitive, reached critical mass in vendor support. The fundamental Hadoop stack is changing, and MapReduce’s importance is waning.
  • Streaming data processing and machine learning are now on-radar for the broader data analytics marketplace, making this one of the next frontiers for big data and a focus area for vendors.
  • Suddenly several transactional databases are integrating analytical capabilities in a quest to serve in mixed-workload capacities. The sprawl of data across multiple databases may be coming to an end.

The post Big data and analytics second-quarter 2014: analysis and outlook appeared first on Gigaom.

]]>
How to solve big data challenges in the ad-tech industry https://gigaom.com/report/how-to-solve-big-data-challenges-in-the-ad-tech-industry/ Thu, 03 Jul 2014 18:58:29 +0000 http://research.gigaom.com/?post_type=go-report&p=232582/ Advertising technology involves a formidable set of requirements, and a new category of database known as NewSQL is emerging to address multi-genre analytics.

The post How to solve big data challenges in the ad-tech industry appeared first on Gigaom.

]]>
Advertising technology (ad tech) involves a formidable set of requirements. Operationally, ad-tech tools must work at lightning speeds to offer and complete bids for ad inventory. Meanwhile both historical and real-time data must be analyzed to inform and optimize those operational actions. Conventional operational databases have difficulty scaling to meet these requirements. Analytics databases fall into different categories with their strengths and weaknesses, and they address few if any operational requirements. This can make building ad-tech systems difficult, to say the least.
But a new category of database known as NewSQL is emerging to address multi-genre analytics requirements. Some NewSQL databases even handle operational workloads as well. In this report we investigate ad tech’s requirements and how NewSQL databases address them, and we reveal how one ad-tech firm is using NewSQL databases to improve and simplify its architecture and infrastructure.
Key findings include:

  • Ad tech has exacting requirements that push the analytics envelope. It is heavily reliant on precision and real-time and predictive analytics, with high data volume workloads and demanding SLAs.
  • Scale-out architectures and in-memory technology featured in newer database products work well for ad-tech work and other analytics workloads with high-volume, high-velocity data.
  • Meanwhile sacrificing the relational data model, transactional consistency, and SQL compatibility of conventional databases brings risks, productivity hits, and staffing challenges.
  • NewSQL databases combine facets of different database types such that newer technologies and older standards coexist for maximum benefit.

The post How to solve big data challenges in the ad-tech industry appeared first on Gigaom.

]]>