You are here

Content migration burden: it's not just automated or manual

Website Migration Handbook
· ·

How much burden will each team face?

The content migration process is a bit more subtle than is obvious on the surface. Notably, it isn't just a decision of whether to automate or not, but how much burden the different teams will face. Sure, maybe a fifty page site should be migrated entirely manually, but for anything much larger there will be a blend of automation and manual intervention. Often the technical and non-technical teams don't really understand each other (or the migration process), so the technical team just says it will automate what it can and then hand it to the less technical teams to clean up. The technical team might do what amounts to an automated copy and paste with a twist of trivial cleanup using a tool like HTMLtidy. That's a bit like just kicking a bunch of rocks down the mountain for the people living below to deal with.

Problems with the simple two-step automation approach

Some problems with this technical-team-hands-off-to-other-teams approach:

  • Unplanned cost for nontechnical teams. By shifting the burden to the nontechnical teams (also typically known as The Client), these teams then have more work than they were probably anticipating. This approach gives no chance for effective estimation (see Why estimate? I'm not getting more resources for this site migration..
  • Lost opportunities. Often a lot more can be automated than is initially discussed (see Content Migration: What Can Be Automated and What Must Be Manual), and by implementing in this two-phase hand-off approach these possibilities are easy to miss.
  • Lower quality. If all of this manual burden happens at the end of the process, then quality will probably start being tossed out the window.
  • Project slips. Obviously another outcome of this approach is project slips.

Dangerous Phrases

Here are some dangerous phrases to listen out for:

  • "We'll automate what we can". This is a signal that the technical team isn't engaging in a discussion about the migration but will bang out whatever they can quickly to ram the content into the new system.
  • "We looked at the content and it isn't regular enough to automate". If there are only ten content items that are in question, then move on and migration manually. If you have a large batch of content, you might want to dig in further. I once had a system integrator say that just because the phrase "Description" was used on some pages and "Overview" on others that it couldn't scrape out the content components. Wrong: regular doesn't mean identical and it is trivial to deal with these types of issues once identified.
  • "We'll turn it into XHTML and then you take it from there". Run. Fast. Turning into XHTML is trivial, and you probably need transformation as well. This is a sign that the technical team is not thinking creatively about migration.
  • "Since you'll have to edit the content anyway, you might as well clean up the HTML as well". Remember that editing and QA are different.
  • Anything ending in "and you clean up after that". The process should be iterative, not resulting in the editorial teams cleaning up unnecessarily.

A Better Planned Migration Approach

Instead of just stumbling into the migration process, I would recommend a more planned approach. Some ways of doing this (also see Website Migration Handbook):

  1. Do a proof of concept and pilot
  2. Iterate on the migration (so it isn't just two steps of "automating what we can" and then leaving it to the the content owners to clean up all sorts of problems)
  3. Keep estimating the cost of the migration, including both technical and non-technical resources, to keep the dialog going about where the burden will fall
  4. Ensure that all of these types of issues are talked about when initially working with your development shop, system integrator, or internal technical team to help determine an approach to migration instead of just hoping for the best later
  5. define what acceptable quality is

Website Migration Handbook

First published 12 July 2010