Content management

February 24, 2010 - 5:28 GMT

What format should I be publishing my content in? This is a question we come across a lot in our work. Business documents exist in a wide variety of formats from Word, Excel, PowerPoint and PDF to web content and everything in between. As publishing models continue to evolve, distributed authorship capabilities are increasingly being offered to a wider, more disparate set of business users. This decentralized approach, while pushing document management directly into the hands of business owners, also supports its rapid publication and delivery to consumers.

November 24, 2009 - 9:23 GMT

Subtitle: The Future of Taxonomy... Ad Nauseum

This year's Taxonomy Bootcamp conference was much like years prior: full of great information, knowledgeable speakers, and a ton of self-doubt/-defense/-definition. Which is ironic: professional organizers who struggle to classify themselves. There were at least 3 major sessions dealing with the taxonomist's identity and future (in a 2-day conference with a single track, that's a lot), which left me feeling a bit estranged.

The opening session by Patrick Lambe discussed the identity of the "new taxonomist" in the field, using results from a survey of members of the Taxonomy Community of Practice. His findings were unsurprising to me at least: 

November 09, 2009 - 10:11 GMT

I'm now back home in Montreal fighting jet lag (yawn) after a great week at the J.Boye 2009 conference in Aarhus, DK. It was a great experience I hope to repeat. Perhaps my favourite event was the last one - the town hall debate, reserved for the die-hard attendees who hadn't run off to the airport. In this session, we were given 6 statements to consider and vote upon, after hearing from two "debaters" who represented the pro and against. In this case, we had the "good doctor" (David Ott, attendee from World Health Organization) and the "panda" (Neil Morgan, attendee from Word Wildlife Federation).

This is a seriously fun session that not only drew lots of laughs (I'll point out the memorable quotes), but also some intense debate. Here's a recap for those who missed out, but also this can serve as a checklist at J.Boye 2010 to see if our predictions came true. (You can also watch the video)

#1. CMS is a commodity.

Pro (Ja): Sure, "CMSs are like water - get it from one company or another, it's all dirty". (Janus Boye) There's not much difference between all the systems and vendors, so it'll eventually be like choosing a brand of shampoo. Lather, rinse, repeat every 3 years.

Against (Nej): CMSs are actually quite different - there are some that are industry-specific, scenario-specific, there are no real standards governing their architecture. You need the appropriate tool for the appropriate task - choose wisely.

Result: AGAINST - CMS is NOT a commodity. It'd be interesting to count how many in the audience were vendors... Let's see how we feel about this one after the web idol contest in 2010.

October 12, 2009 - 8:18 GMT

"How many content types should you have?"

This is the question that came up in a conference call last week on SharePoint architecture. This organization had implemented their corporate portal on SharePoint 2007 and was interested in going forward with more portal sites but had some concerns about the approach to information architecture they had undertaken.

I answered what I would answer no matter what technology it was - "Only as many as you really need to implement the appropriate level of metadata, workflow and templates." Which is of course vague, as most good consultant-ese is. I followed up with some stats: when we work on web content management implementations, we typically end up with about 10-15 content types for a site of medium complexity. We always try to keep the structure simple and number of content types few for many good reasons, ranging from ease of content structure management to content publisher user experience.

The folks on the phone were quiet for a minute... You see, the previous consultant they had worked with had a bit of a different (read opposite) approach. The philosophy they described was that SharePoint content types should be created to the maximum degree of granularity (e.g. one content type per library) so as to reduce the need for content publishers to select a content type and tag metadata values. For example, if you had a site for human resources forms, you would have one library and content type for medical forms, one library and content type for dental forms, etc. Each content type would be extremely specific and require little tagging. "If you need 30,000 content types, then so be it" is the idea. (insert eye twitch.)

September 29, 2009 - 8:40 GMT

The final draft has been submitted... Mark your calendars...

The Information Management Best Practices 2009 book is going to publication this week, in hopes of being ready for launch at the J.Boye Conference in Aarhus, Denmark, Nov 2-4. I'll be there, giving a talk on SharePoint IA, but also to lend a hand with the book launch activities.

I'm proud to have a chapter in this book, with co-authors Seth EarleyCharlie Gray (CMS & Taxonomy Strategist, Motorola), on one of our most in-depth and successful projects - integrating taxonomy with CMS at Motorola. The best practice covers the steps below in great detail, offering practical advice and screenshots from the actual implementation at Motorola.

September 23, 2009 - 12:18 GMT

How often do you get to be immersed in a completely alien work environment?

As a taxonomist, I get to learn about so many different domains through my work, from mouse genetics to greeting card manufacturing. Each company has its interesting quirks and workplaces...Like the toy manufacturer, whose workers had their cubicles adorned with all sorts of inspiration and materials: multi-colored fur, googly-eye collections, pictures of themsleves as superheroes... 

But this week, I got to experience something completely different.

We just started a content strategy project with a semiconductor equipment manufacturer which aims to help their service groups (the folks who fix the machines) get the right information at the right time. This is an interesting project involving issues around technical writing and information architecture (DITA), integration across many different knowledge systems and databases, and getting information to users in a less than hospitable environment - the clean room.

April 07, 2009 - 6:51 GMT

Oasis

Early last month, OASIS announced the approval of the Unstructured Information Management Architecture Version 1.0.  This standard creates an open method for accessing unstructured information - that is, any information that is created by and for people, and is not inherently machine-readable (e.g., not data).  UIMA can potentially become very important since it provides a standard mechanism to exchange metadata for all types of unstructured content - documents, web pages, email, voice, images and video.

As we all have heard repeated in the marketing messages of every content-related software company, over 80% of the data we run our businesses on is unstructured.  In our business we help our clients tame their mountains of content by classifying it.  Often we rely on technologies like auto-classification, entity extraction, and other analytics to tag content with metadata.  Metadata helps us bring structure - and in turn semantics or meaning - to unstructured content. 

Of course, each of these systems has its own API and its own methods of expressing the metadata it produces or consumes.  This is where UIMA comes in.  In the introduction to the UIMA standard, the team at OASIS describes a typical workflow in which various analytics packages may need to interact:

November 10, 2006 - 12:52 GMT

I have been presenting at several conferences in the past couple of weeks (10 sessions in two weeks) and I am still getting the same situation over and over again. I had an attendee in a workshop on a content management maturity model say: "I am not sure where to start. It feels like this is so overwhelming. Can't we buy the tool first? That's what my boss wants me to do." I can understand why this is a first reaction to the complexity of content management. There are so many issues and factors to consider. From business problems to content architecture, existing systems that require migration and integration, user needs and scenarios, meta-data standards, taxonomy development, work-flow processes, governance, change management and so on. The first time you are going through this, it is overwhelming. But choosing a tool before understanding exactly what you need can create at least three major problems:

August 14, 2006 - 7:28 GMT

An interesting problem was posed to a mailing list I am a part of...

Imagine that you have been using a single hierarchy to structure and organize your information for years, and it has been very successful up until now...

But now it is time to move to a different content management system, and not only that - business has changed (of course), and not every way of organizing and understanding the information could possibly have been anticipated. (Or perhaps you did anticipate some, but for practical matters limited the amount of metadata you might apply to content.) So you have new ways that users want to search and navigate, but never considered these at the start. What do you do?

August 08, 2006 - 9:34 GMT

I wanted to post a preliminary note about our new Jump Start calls planned for September and October. Each series will consist of 4 - 90 minute calls each with 2 - 3 presenters who are experts and practitioners in their fields. Here is the tentative schedule:

Content Management Jumpstart: Each Monday 1:30 - 3:00 EST September 25th through October 16th - Topics will include:

  • The basics of content management
  • Building a content management business case
  • Content management frameworks & governance
  • Integrating content management with business processes
  • Tagging, metadata and taxonomies
  • Content publishing models & reuse
  • Personalization & targeted content
  • CMS selection & deployment
  • Web content management & syndicating content
  • Global content management

Search Solutions Jumpstart Each Friday 1:30 - 3:00 EST, October 13th through November 3rd - Topics will include: