有从PDF中提取文字的控件吗?如“Aspose.Pdf for .NET”
我们有Aspose.Pdf for .NET的控件,可以从PDF提取文字,但是对中文字体支持不是很好,建议你试用一下 PDFlib TET 这个控件,相对来说对中文支持好得多。
请推荐html->PDF, Word->PDF的好产品
根据您的需求,我们建议结合Aspose.Pdf,Aspose.Words,Aspose.PDF.Kit三个产品来使用,可以达到您所描述的需求。
收藏订阅
Aspose.Pdf for .NET 3.9.0.0
Introduction
In this release, PDF/A-1a is supported and now the PDF/A-1 is fully supported (Beta version). In addition, Widows/Orphans control is supported. Users can disallow/allow Widows/Orphans easily by turning on/off Widows/Orphans control. The macro #$UNICODE() is supported which can be used in both Heading and Text. Direct-to-stream mode is also supported. It is known that each font has a set of supported characters. Sometimes, users may assign a font to a Segment paragraph which doesn't support every character appear in the Segment. In this release, it is possible to adjust fonts automatically. It will select a proper font to the Segment paragraph according to its contents. Besides, some important features are enhanced and some bugs are fixed.
What's New?
What's improved?
What's Fixed? The main bugs fixed are listed as following:
Aspose.Pdf for Java 2.4.0.0
In this new release, a lot of important features about Section, Text, Table and Image have been improved to enable customer to use them more conveniently and effectively. HeaderFooter class has been rewritten to give customer more control as well as keep conformance with .net. Some bugs have been fixed to provide customer more practical function.
What's Improved?
What's Fixed?
The main bugs fixed are listed as following: