Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Hubs] Support for large datasets #57

Open
5 of 20 tasks
flanakin opened this issue Feb 18, 2023 · 3 comments
Open
5 of 20 tasks

[Hubs] Support for large datasets #57

flanakin opened this issue Feb 18, 2023 · 3 comments
Assignees
Labels
Tool: FinOps hubs Data pipeline solution Type: Feature 💎 Idea to improve the product

Comments

@flanakin
Copy link
Collaborator

flanakin commented Feb 18, 2023

📝 Scenario

As a FinOps practitioner, I need to ingest data into a queryable data store in order to report on data at scale beyond $5M/mo

💎 Solution

Support large datasets (e.g., 500 GB/mo) with up to 7 years of historical data that refreshes when changed by adding an option to ingest data into Azure Data Explorer and update reporting to leverage that database.

📋 Tasks

Required tasks

Preview Give feedback
  1. Needs: Information Tool: FinOps hubs
    Springstone
  2. Tool: FinOps hubs
    Springstone
  3. Tool: FinOps hubs
    Springstone
  4. Skill: Documentation Tool: FinOps hubs
  5. Tool: FinOps hubs

Stretch goals

Preview Give feedback
  1. Needs: Information Tool: FinOps hubs Tool: Power BI
  2. Needs: Information Tool: FinOps hubs
  3. Needs: Information Tool: FinOps hubs

ℹ️ Additional context

There was an internal analysis of the optimal data store to use for the largest datasets and Azure Data Explorer was deemed to be the best option that balanced cost, performance, and scale.

🙋‍♀️ Ask for the community

We could use your help:

  1. Please vote this issue up (👍) to prioritize it.
  2. Leave comments to help us solidify the vision.
@flanakin flanakin added the Type: Release 🚀 Tracks the progress of a release label Feb 18, 2023
@flanakin flanakin self-assigned this Feb 18, 2023
@flanakin flanakin added the Skill: Data factory Data Factory integration label Feb 18, 2023
@flanakin flanakin changed the title v0.2 – Support for large datasets Hubs v0.2 – Support for large datasets Apr 6, 2023
@flanakin flanakin added this to the 0.2 milestone Dec 4, 2023
@flanakin flanakin modified the milestones: 0.3, 0.4 Jan 23, 2024
@flanakin flanakin assigned Springstone and unassigned flanakin Jan 28, 2024
@flanakin flanakin changed the title Hubs v0.2 – Support for large datasets [Hubs] Support for large datasets Jan 28, 2024
@microsoft-github-policy-service microsoft-github-policy-service bot added the Tool: FinOps hubs Data pipeline solution label Jan 28, 2024
@flanakin
Copy link
Collaborator Author

Closing this since we're tracking releases in a new way now and this is outdated.

@flanakin flanakin closed this as not planned Won't fix, can't repro, duplicate, stale Jan 28, 2024
@t-esslinger
Copy link

t-esslinger commented Jan 30, 2024

Hello @flanakin, is this feature still in your backlog? We would we highly interested in being able to handle also larger datasets more easily.

@flanakin flanakin added Type: Feature 💎 Idea to improve the product and removed Skill: Data factory Data Factory integration Type: Release 🚀 Tracks the progress of a release labels Jun 26, 2024
@flanakin
Copy link
Collaborator Author

@t-esslinger Sorry for missing the comment. Yes, this is still in the backlog. We're making progress slowly. I'm reopening this issue to track everything needed.

@flanakin flanakin reopened this Jun 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Tool: FinOps hubs Data pipeline solution Type: Feature 💎 Idea to improve the product
Projects
None yet
Development

No branches or pull requests

3 participants