Idiomas

Iniciar sesión
Crear nova conta

Navegación

Doazóns recentes

Tatiana Hewett
doou AUD 20.00

Santiago Rodriguez
doou € 10.00

Petr Alt
doou € 142.00

Pablo R. P.
doou € 20.00

Luca Mezzolla
doou € 20.00

Doar agora!

bc1q3t3vxjhd3dmvg3cfn24k4l7n4mf750utpp75hn

Enviado por muhammed o Dom, 07/21/2013 - 05:46

is there a program that makes pdf docs searchable

Por favor lea e siga as Regras da Comunidade.

Inicie sesión ou rexístrese para enviar comentarios

4 respostas [Última entrada]

Dom, 07/21/2013 - 05:46

muhammed

Desconectado

Joined: 04/13/2013

the kind of pdf doc where the text is a scan (image) of a physical text document

I found this page (below). Does anyone know whether this program will work with pdf docs?

http://packages.trisquel.info/dagda/amd64/graphics/tesseract-ocr

Dom, 07/21/2013 - 09:44

Platypus333

Desconectado

Joined: 12/10/2010

The gImageReader and OCRFeeder front-ends are listed as opening pdf filetypes. There may be others too.

http://en.wikipedia.org/wiki/Tesseract_%28software%29#User_interfaces

OCRFeeder is in the Trisquel repository as package ocrfeeder .

Dom, 07/21/2013 - 12:04

lembas

Desconectado

Joined: 05/13/2010

>I found this page (below). Does anyone know whether this program will work with pdf docs?

OCR will likely work with pretty much any file format.

You could also try the pdftotext command from poppler-utils package.

Dom, 07/21/2013 - 17:25

Magic Banana

I am a member!

I am a translator!

Desconectado

Joined: 07/24/2010

'pdftotext' does not do OCR. It only works on documents edited from a program (real characters, not images of them).

Lun, 07/22/2013 - 04:18

muhammed

Desconectado

Joined: 04/13/2013

Thanks guys, this really helps

Inicie sesión ou rexístrese para enviar comentarios

top

Idiomas

Navegación

Doazóns recentes

is there a program that makes pdf docs searchable