dreadedmonkeygod . net

Legacy Data

Okay, so I want to move this site to XHTML. Updating the page templates is easy. In fact, I think I've already done it, as much as is possible while still validating as HTML 4.01 Transitional.

The problem is: all my posts are written in HTML 4.01 Transitional. (And some of them don't validate as anything, they're so sketchy.) So no matter what I do to fix the templates, the actual content of the pages will fubar my hopes at XHTML validation.

Yeah, if this is the worst of my problems, I'm doing pretty well in life. True, true.

But it's still annoying to get tagged by something I knew about at the time, saw comming, and handwaved. "I'll get to that in my Copious Free Time," I said. Well, now I have a few things I want to do, and they can either only be done in XHTML, or XHTML is a significant help. Either way, the time has come. And I'm not going to make the same misake twice.

Now, I'm fine writing new posts in XHTML, so long as it's machine-validated. Why? Because I want to retain the ability to insert links, images, etc. without having to learn a bogus new pseudo-language like Markdown or something. And because once I get to XHTML, I can XSL-Fu my way to whatever other format I want.

So, my mission:

  • Find a way to automagically update three years' worth of garbage HTML so that it'll validate as XHTML.
  • Validate all new posts as XHTML, and all comments as a subset of XHTML. (For example: links, bold, and paragraphs fine; images, style attributes, javascript verboten.)

Yeah, I could be out curing cancer or something. But I'm doing this.

Post a Comment

Name:
Email (Never, ever displayed.)
URL:
Remember me next time.
Comments (Sorry, no HTML allowed. Space paragraphs with a blank line.):