View previous topic :: View next topic |
Author |
Message |
Q3Man n00b
Joined: 03 Aug 2003 Posts: 5
|
Posted: Fri Apr 14, 2006 9:59 pm Post subject: Recommendations for Document Management System |
|
|
I need some recommendations on software, which may or may not even exist
I'm working on transitioning a small business off of a win2k server onto gentoo one service at a time, and everything is going well. One area that is in desperate need of improvement is in document management. As of now, a tremendous amount of documents are generated and stored in a organically-grown share on thier server. A significant percentage of those documents are PDF's from thier network scanner whose text is of course unsearchable.
I've been playing around with both knowledgetree and owl to try and introduce some sort of order but both seem unsuited for the task. What I'm looking for (which I admit may not exist) is a document management system with defineable metadata (such as KT and Owl) but also supports (even poor) OCR of PDF's and allows you to search through them. Right now, it seems that my best bet is to start diving into Owl's code and adding some OCR support.
Are there any other document management systems that anyone has run across? I'm interested in both good and bad results. I'm looking at ~250k documents of various kinds, and certian documents need custom metadata (such as signing dates or expiration dates).
Having the documents available inside a filesystem is a big plus. Google Desktop Search was a godsend for them, and I'd like to keep that functionality. |
|
Back to top |
|
|
brot Guru
Joined: 06 Apr 2004 Posts: 322
|
Posted: Fri Apr 14, 2006 11:32 pm Post subject: |
|
|
maybe not what you are asking, but it looks like kat (when using kde) oor beagle (for the gnome in you) is worth looking for you. (should do the same what google desktop search does) |
|
Back to top |
|
|
Vogateer n00b
Joined: 27 Jul 2004 Posts: 49 Location: Oklahoma
|
Posted: Sat Apr 15, 2006 4:29 am Post subject: |
|
|
Hmmm, never used any of these yet, but I've been thinking about trying one out at work.
There's xinco:
http://www.xinco.org/
I'd like to know more about these myself. Anyone have any experiences with some DMSs? |
|
Back to top |
|
|
z0mb13 n00b
Joined: 03 May 2005 Posts: 10 Location: Cape Town, South Africa
|
Posted: Tue Aug 08, 2006 10:34 am Post subject: |
|
|
KnowledgeTree OSS 3.x indexes numerous file formats including PDF's.
ebuild at http://packages.gentoo.org/ebuilds/?knowledgetree-3.0.3
Also KnowledgeTree Tools for Windows now has ImagingTools in Beta2 which provides OCR and indexing of scanned documents. |
|
Back to top |
|
|
pactoo Guru
Joined: 18 Jul 2004 Posts: 553
|
|
Back to top |
|
|
yakapiece Tux's lil' helper
Joined: 03 Feb 2004 Posts: 126 Location: Atlanta, GA
|
Posted: Tue Aug 22, 2006 5:50 pm Post subject: |
|
|
Q3man,
I'm not sure what you have had success with yet. I just tried xinco - under it's feature lists it hints at having some sort of OCR - however it doesn't (as far as I can tell). I've been playing around with it and it seems like a very basic repository for files (with a java UI).
I'm going to try out Ktree. I've used it in the past but I'm interested in the client-side tool that does OCR.
If you want to update this thread it might help!
Thank you |
|
Back to top |
|
|
|