00 - Data Center
Related page | Link |
---|---|
High Level Architecture | 00 - High Level System Design |
Key Concept: Process Area to Composite Application Mapping
The "Composite Applications" (e.g. Summit Data Management, External Data Management, etc.) are the headings under which the software supporting Data Center processes are described from a system perspective.
Key Concept: Data Stores
Data is central to the processes and supporting software for the Data Center. The data stores of the Data Center are depicted below along with more detail about how they are factored, how they relate to each other, and the technologies which support them.Â
Data Center Process Rules
Rule | Justification |
---|---|
Data Center user passwords are required to be "strong." | This is part of the security plan. |
Raw search requests require approval. | Raw data is of questionable external use. Requestor must be deemed able to use the data in order to justify the use of DC resources to assemble the raw data for distribution. More rules about escalation TBD. |
Email addresses must be verified for the account to be used. | We need to know that we can contact a user. |
For a user to be an Authorized Agent, they must have the approval of the DKIST Director. The authorization shall remain in effect until notification from Data Center Scientist. | Operations activities may require the review of data (including proprietary data) to handle calibration and instrument performance issues. These issues would be handled by non-Data Center personnel, and therefore would be required to use the end-user facing functions to retrieve data. |
DKIST Authorized Agents shall not use proprietary data for science. | It is the DKIST Authorized Agent's responsibility to know whether the data they are using for science is proprietary, and if it is, to not use it. |
proprietary data shall only be distributed externally to users when the following criteria is met: - the embargo is currently "active" - the user is an investigator or co-investigator on the proposal associated with the data - the user is an Authorized Agent of the DKIST | Embargo statuses support PhD students in keeping their data private for a period of time (typically 12 months) so as to not allow interference with their theses. |
No science data younger than 6 months can be removed. | This is a requirement. |
Processed Frames shall be proprietary based upon the proposal associated with a Processing Candidate. Artisanal (i.e., no candidate affiliation) Input Datasets are exempt from proposal embargo rules. | Input Datasets indicate the proposal for the observation's operational run. This is either inherited from the calibration association set's primary operational run or set by the calibration support team when constructing an Artisanal Input Dataset. |
Before the removal or modification of failed science ingest data, the instrument scientist must be consulted. | Science data will potentially be discarded, so a scientist must sign off on the action. |
Data ingested into the Data Center from non-Data Center sources must be provided through an authorized channel. | This is part of the security plan. |
Processed data shall be made discoverable to all users, both registered and unregistered (this excludes proprietary data, as its distribution has been defined differently in a previous rule). | This is part of the data policy. |
Science Data distributed from the Data Center to external destinations must transit an authorized channel. | This is part of the security plan. |
Data Center Process Roles and Responsibilities
Role | Responsibility | Strategy Area | Applicable Process |
---|---|---|---|
Calibration Support | Evaluate, and Approve or Disapprove Raw Search requests | Ops Support | Service Desk Management - Approval |
Calibration Support | Cancel Recipe Run | Science Data Processing | Process Scheduling |
Calibration Support | Create a dataset review ticket for superseded datasets | Science Data Processing | Reprocessing |
Calibration Support | Create Input Datasets comprised of input frames, calibration parameter values, and primary proposal ID | Science Data Processing | Input Data Assembly |
Calibration Support | Create Recipe Instances and Recipe Runs for reprocessing | Science Data Processing | Reprocessing |
Calibration Support | Create Recipe instances combining applicable Recipes and Input Datasets | Science Data Processing | Process Scheduling - Readiness Evaluation |
Calibration Support | Create Recipe runs based upon Recipe Instances for scheduling | Science Data Processing | Process Scheduling - Readiness Evaluation |
Calibration Support | Create Recipe Specifications | Science Data Processing | Manage Recipes - Create |
Calibration Support | Create work tickets for Calibration Associations that have no applicable Recipe | Science Data Processing | Process Scheduling - Readiness Evaluation |
Calibration Support | Determine Recipe applicability for Calibration Associations that are ready | Science Data Processing | Process Scheduling - Readiness Evaluation |
Calibration Support | Determine superseded datasets due to a planned reprocessing | Science Data Processing | Reprocessing |
Calibration Support | Execute manual Recipe Runs | Science Data Processing | Manual Processing |
Calibration Support | Identify Calibration Associations requiring Input Dataset assembly | Science Data Processing | Input Data Assembly |
Calibration Support | Pause and activate resource scheduling | Science Data Processing | Process Scheduling |
Calibration Support | Update Recipe Specifications | Science Data Processing | Manage Recipes - Update |
Calibration Support | Manage Science Data ingest failures | Summit Data Reception and Ingest | Ingest Failure Management |
Data Center | Send data in a check-pointed manner from a virtual folder to an authorized user endpoint | Distribution | User Download |
Data Center | Execute removal of proprietary proposals | External Data Ingest | Ingest Proprietary Proposals |
Data Center | Execute removal of proposal investigator links | External Data Ingest | Ingest Proposal Investigator Link |
Data Center | Ingest new Calibration Association sets | External Data Ingest | Ingest Data Acquisition Support Data |
Data Center | Ingest new Proprietary proposals | External Data Ingest | Ingest Proprietary Proposals |
Data Center | Ingest new parameter values for existing parameters | External Data Ingest | Ingest Parameter Values |
Data Center | Ingest new proposal investigator links | External Data Ingest | Ingest Proposal Investigator Link |
Data Center | Ingest updated Calibration Association sets | External Data Ingest | Ingest Data Acquisition Support Data |
Data Center | Raise a ticket if a Calibration Association set being updated is already in use | External Data Ingest | Ingest Data Acquisition Support Data |
Data Center | Assign Service Desk tickets to a group based upon ticket type | Ops Support | Service Desk Management - Assign |
Data Center | Audit and record inventory to object accuracy | Ops Support | Data Holding Audit - Inventory to Object |
Data Center | Audit and record object integrity | Ops Support | Data Holding Audit - Object Integrity |
Data Center | Audit and record object to inventory accuracy | Ops Support | Data Holding Audit - Object to Inventory |
Data Center | Generate aggregate events prior to removal of transactional events | Ops Support | Monitoring |
Data Center | Generate alerts based upon event review rules | Ops Support | Monitoring |
Data Center | Notify investigators when newly processed data is available based upon user preference (i.e., if their digest setting is on or off) | Ops Support | Processed Data Notification |
Data Center | Provide password reset instructions to valid email address requests | Ops Support | User Registration - Forgotten password |
Data Center | Record email verification | Ops Support | User Registration - Forgotten password |
Data Center | Regenerate lost Inventory Cards | Ops Support | Data Holding Audit - Object to Inventory |
Data Center | Remove transactional monitoring events greater than 1 year old if space is needed | Ops Support | Monitoring |
Data Center | Request email verification | Ops Support | User Registration - Forgotten password |
Data Center | Retrieve monitoring info from other DC systems | Ops Support | Monitoring |
Data Center | Route Service Desk tickets for approval | Ops Support | Service Desk Management - Create |
Data Center | Create work tickets for Recipe Runs identified as "manual" | Science Data Processing | Process Scheduling |
Data Center | Execute automated Recipe Runs | Science Data Processing | Automated Processing |
Data Center | Flag Calibration Associations as "ready" when all data has been received, or when 13 days from acquisition have passed, whichever comes first | Science Data Processing | Process Scheduling - Readiness Evaluation |
Data Center | Generate a Dataset Inventory record containing aggregate metadata from the associated frames | Science Data Processing | Science Data Ingest |
Data Center | Generate a Frame Inventory record containing header values | Science Data Processing | Science Data Ingest |
Data Center | Incorporate process management header values into the ingested Frame | Science Data Processing | Science Data Ingest |
Data Center | Ingest Processed Data Frames into the Object Store | Science Data Processing | Science Data Ingest |
Data Center | Schedule resources for Recipe Runs identified as "manual" | Science Data Processing | Process Scheduling |
Data Center | Create Frame Inventory records for science data that have been ingested | Summit Data Reception and Ingest | Science Data Ingest |
Data Center | Ingest Science Data | Summit Data Reception and Ingest | Science Data Ingest |
Data Center | Receive data from the Summit | Summit Data Reception and Ingest | Data Center Receipt |
Data Center | Record expected Frame counts by Observing Program Run ID | Summit Data Reception and Ingest | Transfer Manifest Ingest |
Data Center | Record receipt count of science data ingested by Observing Program Run ID | Summit Data Reception and Ingest | Science Data Ingest |
Data Center | Retain Ancillary Data for at least 90 days | Summit Data Reception and Ingest | Ancillary Data Ingest |
Data Center | Route data from the Summit to the appropriate ingest process | Summit Data Reception and Ingest | Data Center Receipt |
Data Center Operations | Create and review system event reports | Ops Support | Monitoring |
Data Center Operations | Create event alert rules | Ops Support | Monitoring |
Data Center Operations | Create Help Tickets | Ops Support | Service Desk Management - Create |
Data Center Operations | Ensure the embargo (proprietary) (proprietary) Update Rule is followed | Ops Support | User Authorization Management |
Data Center Operations | Activate/deactivate user accounts | Ops Support | User Registration - Inactivate/Activate Users |
Data Center Operations | Maintain service desk ticket status to reflect work status | Ops Support | Service Desk Management - Do Work |
Data Center Operations | Notify requestors of ticket completion for help tickets | Ops Support | Service Desk Management - Do Work |
Data Center Operations | Perform data deletion at the direction of DQAC data removal tickets | Ops Support | DQAC Reduction |
Data Center Operations | Perform Raw Data searches as directed in approved Raw Data search tickets | Ops Support | Raw Search |
Data Center Operations | Resolve open service desk tickets | Ops Support | Service Desk Management - Do Work |
Data Center Operations | Review open service desk tickets | Ops Support | Service Desk Management |
Data Center Operations | Review storage health monitoring information | Ops Support | Monitoring |
Data Center Operations | Route system authorizations to System Admin | Ops Support | User Authorization Management |
Data Center Operations | Stage Raw Data requested in Raw Data search tickets | Ops Support | Raw Search |
Data Center Operations | Update embargo (proprietary) Â Agent Authorizations | Ops Support | User Authorization Management |
Data Center Operations | Update user account information | Ops Support | User Registration - Update |
Data Center Operations | Manage Frame Manifest Ingest Failures | Summit Data Reception and Ingest | Ingest Failure Management |
Data Center Operations | Manage Summit Ingest categorization failures | Summit Data Reception and Ingest | Ingest Failure Management |
DC Project Manager | Coordinate change management issues for Summit Ingest schema discrepancies | Summit Data Reception and Ingest | Ingest Failure Management |
Registered User | Initiate transfer of data from an authorized data center endpoint | Distribution | User Download |
Registered User | Retrieve documentation from dataset links | Ops Support | Code and Algorithm Document Distribution |
Registered User | Search and retrieve code and tools | Ops Support | Code and Algorithm Document Distribution |
Registered User | Search and retrieve documentation | Ops Support | Code and Algorithm Document Distribution |
Registered User | Change password | Ops Support | User Registration - Update |
Registered User | Create Help Ticket | Ops Support | Service Desk Management - Create |
Registered User | Reset password | Ops Support | User Registration - Forgotten password |
Registered User | Update user account information | Ops Support | User Registration - Update |
Registered User | Create requests for Raw Data | Search | Search |
Registered User | Request distribution | Search | Request Distribution |
Registered User | Search for processed data | Search | Search |
Registered User | Select datasets for download | Search | Review Results |
Science | Create DQAC reduction tickets for datasets that need to be removed due to reprocessing | Science Data Processing | Reprocessing |
Science | Review new/superseded datasets to determine data removal action(s) | Science Data Processing | Reprocessing |
System Admin | Create, update and remove system level authorizations for DC users | Ops Support | User Authorization Management |
Unregistered User | Search and retrieve code and tools | Ops Support | Code and Algorithm |
Unregistered User | Search and retrieve documentation | Ops Support | Code and Algorithm |
Unregistered User | Create Help Ticket | Ops Support | Service Desk Management - Create |
Unregistered User | Enter info to create an account | Ops Support | User Registration - Create |
Unregistered User | Verify email address provided | Ops Support | User Registration - Create |
Unregistered User | Create requests for Raw Data | Search | Search |
Unregistered User | Search for Processed data | Search | Search |
Unregistered User | Select datasets for download | Search | Review Results |
Authorized Agent | Determine embargo (proprietary) status of datasets in search results | Search | Review Results |
DKIST Director | Approve Authorized Agent status permissions | Ops Support | User Authorization Management |
Data Center Scientist | Trigger the removal of Authorized Agent permissions | Ops Support | User Authorization Management |