Can anyone share how you handle data archiving, especially when moving to the cloud? Our organization has never archived data before. I’m interested in learning about your approach, how you got business teams to classify their data and set retention periods, and how you managed risks like storage costs.

1.7k views2 Comments

Sort by:

Chief Technology Officera month ago

At a minimum, whenever there is a new implementation project...thats the only time it seems possible to get business user attention and bandwidth. It is assumed there is a contract term and data required. To do it as an ongoing process is complex & expensive and fairly impossible unless you handle it at the time of creation or ingestion or at the time of a new implementation. Not sure if you question is a process or platform one though?

Director of Information Security in Finance (non-banking)a month ago

Critical with archiving is not the place where this happens, it's all about data classification and tagging. You need to at least classify the data in the following categories:
1. Confidentiality (0 - 3)
2. Integrity (0, 1) while 1 implies the need to store the originator of the data with the data
3. Data retention time, so the time the data needs to be archived, other the other way round, the point in time, the data needs to be deleted.

You can neglect the storage cost, archiving has no performance issues, so you can choose the cheap S3, and archive on 2 different locations for availability.

It's also advisable to create an object lock on the archive that deletion is impossible until retention time is over

Content you might like

We use Microsoft PowerBI extensively at the company. Over 15,000 reports.
For our "corporate" reports, like financials or HR, we centralize those requests and attend them with my team.
We currently "extract" information from multiple systems (ie. SAP, SuccessFactors, CRM, etc...) and put in into a SQL Analysis Services. For interpreting the data we have a Semantics model in PBI.
The problem I see is we can't scale enough to comply with the reporting demands as the semantic model is "unique" and we end up having only one or two developers that can work at a time.
Anyone has an idea how to scale their PBI teams without having to redevelop the semantic model per report?

How do you think AI will disrupt business across industries? Add to my list: 1. Content creation 2. Photos and video production 3. Basic coding and debugging 4. Strategic analysis to be highly complimented

What's your most indispensable cloud infrastructure tools to use in addition to what you get from GCP, AWS, Azure?

HashiCorp (Terraform, Vault, Packer, etc.)22%

Cloud infra automation (Ansible, Puppet, Chef, etc.)56%

APM (Datadog, AppD, SignalFX, NewRelic, etc.)10%

Others?10%

View Results

We are considering a scanning solution for detecting Personal Information (PI) in our data assets, mainly AWS S3, DynamoDB, Salesforce, etc. We need the solution to detect PI, identify the type (e.g., infotype), and optionally map it to our own PI categories. We are currently considering BigID. Do you have experience with this tool or any other tools that can help us scan across different data platforms and technologies?

What is your overall attitude towards Edge Computing?

I am a huge fan of this technology20%

I find this technology very useful, yet have some slight doubts64%

I have quite a few doubts about this technology12%

I am not a fan of this technology at all2%

View Results

Sort by:

Content you might like

How do you think AI will disrupt business across industries? Add to my list: 1. Content creation 2. Photos and video production 3. Basic coding and debugging 4. Strategic analysis to be highly complimented

What's your most indispensable cloud infrastructure tools to use in addition to what you get from GCP, AWS, Azure?

What is your overall attitude towards Edge Computing?

What sets us apart?

RELATED ONE-MINUTE INSIGHTS

CrowdStrike Outage: Impact And Recovery

DevSecOps: Strategies, Organizational Benefits and Challenges

Green Cloud Computing

Omnicloud: The Future of Cloud Computing?

Data-Driven Customer Experience: Uniting D&A and CX Teams

Take Your Insights On-the-Go