20 Ago 2012
I have spoken with about 40 IT Leaders this year about NoSQL technologies. I’ve either been tasked with assessing fit, education, bringing in a technology appropriately or just conversation and thoughts about NoSQL. The usual first reaction is one of concern or possibly indifference.
NoSQL has well penetrated the new-era of internet, online marketing and social media companies. Although we may be at the “tip of the iceberg” of the penetration even in those companies, the venture capital in the NoSQL community demands that it have full and complete penetration of the Global 2000 and beyond. While there are obviously many current projects in those companies, I find that there are...
31 Jul 2012
As information projects charge forward to keep up with user demand, the presence of information security is looming stronger than ever. While there is still the desire by Development to push forward unabated, more scrutiny than ever has been placed on projects to identify security risks and address them. Often an afterthought of the past, today these are serious project requirements.
If you are leading development of an information management project, there are no excuses for not incorporating security requirements into project planning. And yes, that includes big data projects! Most of the time, in large organizations, these requirements will come from the security organization. However,...
24 Jul 2012
Every two weeks or so on a project operating with SCRUM, the team will go through a variety of exercises. One is a retrospective on the sprint that is ending. We want to know what went right, what went wrong and challenge our guiding principles for the project. Another exercise is planning the next sprint. As project owner, I will have already done extensive prework for this, reviewing and adding to the backlog and putting the backlog in priority order.
One of the self-organizing aspects of SCRUM is the story point estimation. We want to promise to the business in the next sprint a prioritized set of stories up to an overall estimated story point number that is based on our performance-to-...
17 Jul 2012
The 2012 Pacific Northwest Business Intelligence Summit has ended. This is the eleventh year of this annual event in a cabin setting in Grants Pass, Oregon along the Rogue River. I have been aboard for the last 10 years, only having missed the very first one - which apparently was the one to miss given the downpour that occurred. I was invited to the inaugural one, but was skeptical of the format. The Summit survived to the second, and now the eleventh, year and to say I’ve got over my skepticism would be an understatement.
I look forward to this event each year. It’s a midyear boost of information, ideas and relationships that is one of my strongest data points. I think...
11 Jul 2012
There are a lot of people still struggling with the question of what the big data movement means to them and to their company. They know they have or want as much data as they can get their hands on to better understand their business, products, and customers. But what many people are still struggling with is how they can turn that data into actionable information. With the sheer amount of data we are considering these days, it is easy to get lost in the weeds.
Last month I went to the Cloud Expo/Big Data Expo conference in New York City and had the opportunity to speak with lots of people about what Jaspersoft offers around Big Data. What I found was that in most cases,...
11 Jul 2012
Advocates of NoSQL solutions have positioned them as supporting that class of data we’ve come to know as “big data.” Big data is data to a scale of volume, velocity and variety where NoSQL projects provide price-performance benefits over traditional platforms for the storage of the data. This positioning pigeon-holes NoSQL from good and bad perspectives.
On the one hand, big data has become quite important and is playing a larger role in enterprise information strategies. Though still comprising a relatively small portion of most Global 2000 information technology budgets, it is a fast-growing category that demands attention. It is certainly getting attention from the investment...
11 Jun 2012
When I think of Communities, and especially those related to Open Source Software, I think of a few key benefits:
The ability to for community members to learn from the shared experiences of other members
More relevant product development through community based product roadmap direction and validation
Enhanced quality through the variety of use cases that are exercised throughout the community and the feedback that comes from it
Enhanced customer support through the availability of real-time interactions with the community of people and companies using the product
The top benefit of large communities, in my mind, is the ability to learn from the actual...
30 Mayo 2012
Data Matching, Parsing and Deduplication, hereafter sometimes referred to as Data Matching, are some of the most challenging data quality rules to enforce. Data Matching rules are also some of the toughest to get consensus on and undoubtedly the most labor intensive.
The context for the re-emergence of the data matching on my radar is master data management projects. These projects need to create golden records for wide distribution. Hence, there is little room for error and matching rules must be sound. However, many data warehouse projects are doing matching exercises before the data is made available – because, in those environments, the data warehouse has the golden records. And...
21 Mayo 2012
For those who suddenly find themselves on a working team where the leader is getting excited about “agile”, “SCRUM” or some related word, take heed. Perhaps a class has been instrumental in this excitement or perhaps it’s a mandate (or implied mandate) and now it’s time for the business intelligence team to take it up. Either way, here come the changes.
I’ve brought this change to our teams and clients, learned some bumps along the way and gained some perspective I’ll share here. I won’t focus on what agile/SCRUM is for business intelligence (see http://www.jaspersoft.com/blog-entry/common-sense-and-our-approach-to-information-management...
1 Mayo 2012
Much of our attention these days is focused on where to put workloads across the legacy structures as well as the newer opportunities of Hadoop, streaming, master data management, columnar databases and columnar options, data warehouse appliances, etc. However, there is an aspect of all large data store configurations that can make or break any such strategy and must be focused on to ensure any enterprise information strategy is sound.
This element is the allocation and juxtaposition of storage across hard disk drives (HDD) and solid state drives (SSD), the accompanying allocation of memory and the associated partitioning strategy (hot/cold) within the data. Throw in the compression...