Errors, standards and corrections

From: Alan (aengel@paterra.com)
Date: Fri Sep 06 2002 - 16:40:18 BST


I am currently working through how Paterra Version 2.0 handles sequence
listings and would appreciate some advice from users. This question
relates more broadly to dealing with errors and noncompliance.

While many recent Japanese patent publications that contain sequence
listings that are in compliance with ST.25, many are partially
noncompliant or in older formats. (I have yet to find a US patent that
is in compliance.)

It is possible to write algorithms that automatically bring noncompliant
sequence listings into compliance. For example, nucleic acid sequences
that are all upper case in the original document can be converted to
lower case.

Also, to some extent, one may be able to automatically convert sequence
listings in older formats to ST.25 format as part of the machine
translation process .

What are the views of information users on the issue of automatic
recognition and correction of errors, and also on the issue of automatic
conversion of legacy formats to current standards?

Alan

-- 
----------------------------------
Paterra, Inc.
www.paterra.com

---------------------------------------------------------------------------------------------------------------------- The information contained in this email is confidential and intended only for the use of the individual or entity named above. If the reader of this message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. Derwent Information Limited will accept no responsibility or liability in respect to this email other than to the addressee. If you have received this communication in error, please notify us immediately via email: postmaster@derwent.co.uk ----------------------------------------------------------------------------------------------------------------------



This archive was generated by hypermail 2b30 : Fri Feb 14 2003 - 11:57:05 GMT