Cloud Base Data Lake - Data Virtualization/Logical Data Lake

64%

Status

Executing [Implementation of the project]

64% complete, updated on Thu 9/7/23 3:49 PM by Lance Rivera

Changed Percent Complete from 63% to 64%.

Sep 7, 2023 | Cloud Data Lake Bi-Weekly Meeting

Notes

  • Discuss current status of Projects:

    • Compunnel

      • Champion:  Nandita Signh

      • Current Status:  

        • Phase 2 & 3 are occurring concurrently

        • Workshop with Azure Aug 29th (Craig, Sam, Nandita)

          • Planning build out 

          • TSTC Zero account will go away and move to another MS Domain 

            • MS and AZURE consolidation will be taken as we move forward with Compunnel

            • Who will perform move to do determined

      • Next Steps:  

        • Data Factory proof to Production

          • Azure is a prerequisite for this step

            • Access to MS Eco Syste

            • m to Azure

        • N/A -Setup structured and semi structured data with Azure 

        • Contract to include Phase 3 approved and executed

        • 4 weeks to completion after Azure access

        • No further access anticipated

          • Tableau access was added to another 2 today

          • Workday access was put on hold for now

      • Anticipated completion date: 10/01/2023

    • Data Science Lab

      • Champion: George Makiya

      • Current Status:  Was on hold; now back on track

      • Next Steps:

        • Rediscovery meeting for team - 

        • Tonic access 

        • Architecture MS to assist in designing structure

          • Storage tiers classified - possible 3 tiers

        • Add NSC - National Student Clearing 

        • Google Analytics Data

        • Craig to setup another BI SQL Dev - in another couple of weeks

          • Only copy NSC data to this environment

      • Anticipated completion date:  TBD

    • Denodo

      • Champion:  Bill Holifield

      • Current Status:  Open cases

        • SP-TSA-8559-23-47

          • Workday to Denodo Financials needs caching and help!

            • Solution Offered

 
  • SP-TSA-6210-23-47

    • Data Catalog Advisory Session with Inessa Gerber

      • Waiting for Customer Action

        • Bill will close this case

 
  • SP-TSA-8192-23-47

    • Monitoring Framework

      • Assigned - In Progress

      • Need a fix for the monitoring agent startup issues.

  • Next Steps:  Support mode; Solution is in operation

    • Monitoring framework hours

    • Consulting hours

  • Anticipated completion date:  

    • Implementation is complete?

      • Audit to be reviewed by Bill & Craig

        • 1 hour session - 9/8 @ 9 am

      • Catalog

      • Data Mesh Assessment

      • Monitoring

    • Learning & Support for 3 years

      • Purchased 160 hours

  • Informer 5

    • Champion: Michael LeRoux \ Tina Skidmore

    • Current Status:  Working with Regina (Entrinsik)

    • Next Steps: 

      • Login issue to bring in Entrinsik

        • Tina will enter ticket with Entrinsik

        • All in ver 5 test

      • Vendor access for Entrinsik

        • Lance to contact Regina

          • Access has already been granted 

      • Develop the Access Process between DnA and OIT

        • Development side of data sets 

        • Roles & permissions

          • Security Matrix

        • What does roll out look like

        • Who is responsible for what roles

      • Creating a Governance 

      • Migration to production

      • Phased approach

        • Student Enrollment

        • HR and Finance

    • Anticipated completion date:  Early Fall semester ‘23

  • Microsoft Azure

    • Champion: Nandita Signh

    • Current Status:  See Compunnel

    • Next Steps:

      • SHI conversation 

        • Will bill month to month, Only SHI can see the billing info as they are the middle man

      • Should there be a Blanket PO or pay with a pcard

        • May need to work with Gladia on billing

    • Anticipated completion date:  

  • Power BI Licenses

    • Currently 2 Tiers

      • Pro (25)

        • Consumers

      • Premium (30)

        • DnA Team

        • Executives

    • Power BI Production

      • Gateway connections

      • Access

    • Anticipated completion date:  09/21/23

    • Setup session on how to build connections for gateway

Action items

  • Lance to contact Regina about credentials on Friday, 09/08/23
  • Bill and Craig to meet about the Denodo audit on Friday, 09/08/23
  • Craig to setup session on how to build connections for gateway
  • Janine/Lance - check with Carrie and the workday project for EIV and General to see if it impacts anything we are doing here.


 

Details

Dates
Mon 7/18/22 - Tue 4/30/24
Acct/Dept
Office of Information Technology
Type
Solutions Project Management / Default
Health
Green - On track
Created
Mon 7/18/22 8:52 AM
Modified
Thu 9/7/23 3:49 PM

Project Request Form

Requestor
This is an individual that is listed as the person asking for a project evaluation, may not necessarily be the Project Champion.
George Makiya
Project Champion Supervisor
Project Champion's direct Supervisor
Jonathan Hoekstra
Vice Chancellor
Project Champion's Vice Chancellor for the division
Jonathan Hoekstra
Requested Delivery Date
Enter an estimated date you would like this project completed/delivered
11/30/2022
Do you have an existing solution you are wanting to replace or enhance?
If so, provide some background information on the solution / process you are currently using.
No
What are your goals and objectives this solution will accomplish for your department?
Please provide your goals and objectives this solution will accomplish for your department.
This project will ensure expert configuration and maintenance of the ETL and the Data Warehouse.
List your primary requirements for this solution.
Please list your primary requirements for this solution.
Prospective vendors should clearly demonstrate their capabilities for provision of the aforementioned services. The following are the minimum requirements for the ETL and DW support. There’s concern over the ETL support function. Also, with the lack of data quality checks in the earlier process stages, there are questions about the quality of data being loaded into the Warehouse. This service will ensure the smooth running, configuration and tuning of the environment on an ongoing basis. The remote Database support will ensure optimal performance as well as backups, restores and archival.
Do you have funding for this project? If so, please provide the amount.
Please list Budgeted Amount?
Yes - Guaranteed funding
Please list any subject matter experts who can assist with this project.
Who has the knowledge in your area or other areas that can assist?
George Makiya
Please list any other pertinent information that is relevant to this project request.
Please list any other pertinent information that is relevant to this project request.
In the absence of reliable data quality and data hygiene, the college is unable to optimize its analytic and data driven decision support capabilities. Unreliable data inadvertently leads to poor quality decision making. The college in its current state is incapable of quickly pivoting to take advantage of new opportunities. The lack of agility and trust in the data has limited the leadership’s ability to accurately assess risk and/or conduct necessary analyses with confidence whenever looking to enter new markets, launch new programs or introduce new services.

This project will ensure expert configuration and maintenance of the ETL and the Data Warehouse. These form the bedrock of the data flows to all decision support systems. Expert tuning of the ETL will ensure that it runs efficiently and accurately, limiting errors in the extraction of data and loading into the warehouse. The warehouse maintenance will ensure that the correct schemas and configurations are in place to facilitate rapid modeling of the data. It will also ensure that data are consistent and dependable.

Description

TSTC is, as part of its digital transformation efforts, migrating its data and reporting environment to a cloud-based Data Lake. The legacy systems of record such as Colleague as well as several others will however remain on premise. As part of this cloud transition, some of the on-premises database infrastructure tools and services will be upgraded and/or stabilized prior to full migration.

The current Database (DB) and Business Data Intelligence tools portfolio remains largely inefficient and/or poorly configured. Tools such as the Extract, Transform and Load (ETL) require reconfiguration and fine tuning. The SQL Server database and warehouse require higher skilled expertise to tackle the chronic problems of data hygiene, quality, and traceability/lineage.

The college recognizes a deficiency in capability to efficiently manage and optimize value from the existing ETL and Data Warehouse product suite. This project seeks to engage expert level resources to provide skills augmentation for provision of top-notch ongoing support for the ETL, and the Data Warehouse (DW).

Goals

  1. Collaboratively work with IT and or other divisions to implement new services or redesign current solutions to meet our business line evolving needs.

Manager

Alternate Manager(s)

Stakeholders (7)

BH
Bill Holifield
Data Engineer
Responsible, Accountable
Senior Manager - BI Infrastructure
Fri 6/2/23 3:58 PM
GM
George Makiya
Executive Vice President Data & Analytics
Responsible, Accountable, Consulted, Informed
Exec VP, Data & Analytics
Mon 7/18/22 2:16 PM
GP
Gustavo Perez
Data Analyst
Responsible, Accountable
Data Warehouse Analyst I
Fri 6/2/23 3:58 PM
JH
Jonathan Hoekstra
Vice Chancellor/Cfo
Consulted, Informed
VC & Chief Finance Officer
Mon 7/18/22 2:16 PM
LT
Luan Tran
Applied Research Analyst
Responsible, Accountable
Applied Research Analyst
Fri 6/2/23 3:57 PM
MJ
Madelynne Johnston
Exec Asst to Chancellor
Consulted, Informed
Chief of Staff II- Finance
Mon 7/18/22 2:16 PM
TS
Tina Skidmore
Senior Executive Director
Responsible, Accountable
Director - BI Center of Excellence
Fri 6/2/23 3:59 PM