issue130:critique
Différences
Ci-dessous, les différences entre deux révisions de la page.
Les deux révisions précédentesRévision précédenteProchaine révision | Révision précédente | ||
issue130:critique [2018/02/26 10:02] – auntiee | issue130:critique [2018/02/27 14:27] (Version actuelle) – andre_domenech | ||
---|---|---|---|
Ligne 13: | Ligne 13: | ||
/ | / | ||
- | On m'a récemment donné une clé de produit pour Able2Extract 12, un convertisseur et éditeur de PDF. Auparavant j' | + | On m'a récemment donné une clé de produit pour Able2Extract 12, un convertisseur et éditeur de PDF. Auparavant j' |
Compatibilité | Compatibilité | ||
Ligne 19: | Ligne 19: | ||
Bien que le logiciel ne propose des paquets que pour Ubuntu et Fedora, j'ai réussi à créer un PKGBUILD qui a installé et lance correctement le fichier .deb Ubuntu sous ArchLinux. | Bien que le logiciel ne propose des paquets que pour Ubuntu et Fedora, j'ai réussi à créer un PKGBUILD qui a installé et lance correctement le fichier .deb Ubuntu sous ArchLinux. | ||
- | J'ai néanmoins rencontré un problème dans Ubuntu 16.04, Ubuntu 17.10 et ArchLinux. Plus précisément, | + | J'ai néanmoins rencontré un problème dans Ubuntu 16.04, Ubuntu 17.10 et ArchLinux. Plus précisément, |
#!/bin/bash | #!/bin/bash | ||
Ligne 33: | Ligne 33: | ||
The layout of the application itself is very familiar (after having used software such as Adobe Acrobat), and it offers some helpful (non-intrusive) tips when starting it for the first time.** | The layout of the application itself is very familiar (after having used software such as Adobe Acrobat), and it offers some helpful (non-intrusive) tips when starting it for the first time.** | ||
- | J'ai choisi cette voie parce que le paquet Able2Extract, ne semble pas ajouter bin à votre variable $PATH, ce qui signifie qu'il ne peut être lancé, soit du dossier, soit du fichier .desktop. Après avoir déplacer | + | J'ai choisi cette voie parce que le paquet Able2Extract ne semble pas ajouter bin à votre variable $PATH, ce qui signifie qu'il ne peut être lancé |
Interface de l' | Interface de l' | ||
- | La disposition de l' | + | La disposition de l' |
**Features | **Features | ||
Ligne 47: | Ligne 47: | ||
Fonctionnalités | Fonctionnalités | ||
- | L' | + | L' |
- | J'ai testé les modes Word, Excel et HTML sur quelques scans de recettes de cuisine. Certains de ces fichiers avaient été créés avec la caméra d'un smartphone ; d' | + | J'ai testé les modes Word, Excel et HTML sur quelques scans de recettes de cuisine. Certains de ces fichiers avaient été créés avec la caméra d'un smartphone ; d' |
**The conversion options offered do allow you to handle things such as missing or unrecognized glyphs, or to set the file format for Word and Powerpoint conversions (on my system, it defaulted to OpenOffice). You can also do some document styling such as margins. | **The conversion options offered do allow you to handle things such as missing or unrecognized glyphs, or to set the file format for Word and Powerpoint conversions (on my system, it defaulted to OpenOffice). You can also do some document styling such as margins. | ||
Ligne 55: | Ligne 55: | ||
The creation tool selects an image file and turns it into a PDF - I did not see an option to select text documents or word documents (though you could create PDF files using a PDF printer or something like LaTeX). The editing tools include things like adding stamps, highlights, text, comments, etc. They also include things like redacting sections of files, deleting PDF pages, extracting specific pages, and adjusting text styles. The text style adjustment appears to work only on some PDFs - in my tests these options were grayed out. They probably work only on PDFs that were created from a text document, as opposed to image-based scans.** | The creation tool selects an image file and turns it into a PDF - I did not see an option to select text documents or word documents (though you could create PDF files using a PDF printer or something like LaTeX). The editing tools include things like adding stamps, highlights, text, comments, etc. They also include things like redacting sections of files, deleting PDF pages, extracting specific pages, and adjusting text styles. The text style adjustment appears to work only on some PDFs - in my tests these options were grayed out. They probably work only on PDFs that were created from a text document, as opposed to image-based scans.** | ||
- | Results | + | Les options de conversion proposées vous permettent de gérer des trucs tels que des glyphes manquants ou non reconnus, ou de régler le format de fichier pour des conversions Word et Powerpoint (sur mon système, il est revenu par défaut a OpenOffice). Vous pouvez également appliquer quelques styles à votre document, notamment des marges. |
+ | |||
+ | L' | ||
+ | |||
+ | **Results | ||
As noted in the previous section, almost every attempt I made yielded a complete copy of the PDF. In some cases (low contrast, poorly lit, etc), there were some gaps in the resulting file. These could relatively easily be corrected or filled out (especially if you have access to the original document). The worst result came from a recipe that was in 3 columns - while the OCR system managed to correctly separate the columns (I’ve experienced some that treat 3 columns as 1 line), the character recognition of the actual text was not that impressive. The font in the PDF file was very small, and quite faint, which could have added to the lack of accuracy. The resulting file would have definitely needed proofreading and correcting (though most OCR files should be checked before deeming it finished). | As noted in the previous section, almost every attempt I made yielded a complete copy of the PDF. In some cases (low contrast, poorly lit, etc), there were some gaps in the resulting file. These could relatively easily be corrected or filled out (especially if you have access to the original document). The worst result came from a recipe that was in 3 columns - while the OCR system managed to correctly separate the columns (I’ve experienced some that treat 3 columns as 1 line), the character recognition of the actual text was not that impressive. The font in the PDF file was very small, and quite faint, which could have added to the lack of accuracy. The resulting file would have definitely needed proofreading and correcting (though most OCR files should be checked before deeming it finished). | ||
- | Overall, the results I’ve experienced using Able2Extract 12 rivals any other OCR software I’ve ever used, and is much better than other Linux-based alternatives I’ve tried so far. Is it always perfect? No, but in every test I ran, it yielded a file that would have reduced the effort required to copy the file by hand by at least 50-60%. In most cases it would have required only a few small corrections. | + | Overall, the results I’ve experienced using Able2Extract 12 rivals any other OCR software I’ve ever used, and is much better than other Linux-based alternatives I’ve tried so far. Is it always perfect? No, but in every test I ran, it yielded a file that would have reduced the effort required to copy the file by hand by at least 50-60%. In most cases it would have required only a few small corrections.** |
- | Conclusion | + | Résultats |
+ | |||
+ | Comme noté dans la section précédente, | ||
+ | |||
+ | Globalement, | ||
+ | |||
+ | **Conclusion | ||
If you do a lot of PDF work (splitting documents, OCR scans, etc), and don’t have an application for Linux to do this in, I would highly recommend giving Able2Extract a shot. Even if you have an application you use, you may not be happy with the OCR results - and then I would recommend you try Able2Extract as well. | If you do a lot of PDF work (splitting documents, OCR scans, etc), and don’t have an application for Linux to do this in, I would highly recommend giving Able2Extract a shot. Even if you have an application you use, you may not be happy with the OCR results - and then I would recommend you try Able2Extract as well. | ||
- | It’s almost a perfect score - if the package worked out of the box, and if there were extra options for HTML conversions, | + | It’s almost a perfect score - if the package worked out of the box, and if there were extra options for HTML conversions, |
+ | |||
+ | Conclusion | ||
+ | |||
+ | Si vous travaillez beaucoup avec des PDF (segmenter des documents, faire de la reconnaissance de caractères, | ||
+ | |||
+ | La note que je lui attribue est presque parfaite : si le paquet fonctionnait dès l' | ||
issue130/critique.1519635773.txt.gz · Dernière modification : 2018/02/26 10:02 de auntiee