Three principles of data management: standardization, centralization, and integration
These three principles build on one another to create efficiencies and consistencies within the organization that allow for easier and more timely access to information.
Standardization
Standardizing data and data creation and storage within an organization requires knowing the sources of
the data. Some data sources are consistent across all teams. E.g., all teams use video, keep box-score data, and have scouting reports.
Identifying, locating, and describing all the data sources establishes the organization’s data inventory. In constructing this inventory, organizational decision makers need to consider all functions within the organization.
Each has a unique set of data that it might create, store, and access in its own way, which can significantly slow down the decision-making process. The inventory should be used to create a standard set of definitions for the different kinds of data that the organization uses.
If the standard definitions are not used as the project begins then the project must be carefully reorganized when it becomes a significant data asset. Once these hurdles are overcome and data is handled in a standard manner across all functions, centralizing the data is possible
Centralization
Good data management reduces the time spent looking for the people that can give decision makers access to the information they need and provides a team with a significant competitive advantage.
When all data are centralized, personnel executives can spend more time evaluating and coaches can spend more time strategizing and coaching—providing them an edge over the competition.
Centralization ensures that all decision makers see the same data. When decision makers are getting data from different sources, it is often possible that they see different data even if they are looking at the same variables.
Having one set of consistent data for all decision makers to rely on is commonly referred to as having “one version of the truth.” This efficiency gives coaches more time to discuss and analyze which provides the team with a competitive advantage.
Centralization also allows higher-quality data. Errors in organizational data are a significant problem in general. No matter how sophisticated and thorough a decision-making process is, it will not be successful if the input (the data) is faulty.
When teams use more complex data sets, such as play-by-play data or even motion-capture data, identifying the errors in the data is even more a prerequisite for accurate analysis.
Once all organizational data are centralized, the problems associated with faulty data are reduced in two ways: only the best and most reliable sources of data are used, and consistent error-checking processes can be put in place.
Integration
The integration of data across functions within the organization allows for seamless access to every department’s data. Scouting and medical reports are linked to play-by-play data, which are linked to video
files, and the connections go on.
On its own, each type of data is valuable, but when integrated, there are synergies created among the different data sources that cannot occur when the data are segregated.
Presenting the data in this integrated fashion allows the decision maker to identify and explore the differences of opinion in a more efficient
manner.
Eg, if the analytic data paint a different picture of a player from the scouting reports, medical data may be able to explain the differences. If medical data do not explain, then video lets the GM see the player in action and decide for himself which information is most relevant
The three components of data management discussed here provide a basis for an efficient data-management system that will provide a competitive advantage by saving time for decision makers and creating a more complete picture of the team or player being analyzed.
All of the information is available when the decision maker is ready to begin, and it is less likely that a piece of the information will be missed because the right person was not available to produce to it in a timely manner.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
A clear understanding of data and the various types of data is needed. The word “data,” particularly in the context of analytics, is often associated with quantitative data. Quantitative data, however, is just one type of data that is used on a daily basis by decision makers.
Along with quantified data such as box scores and draft-combine results, decision makers use a host of qualitative data. Qualitative data take a variety of
forms, including scouting reports, coach’s notes, and video.