使用Apache POI将Word转换为HTML
我看到有一个名为WordToHtmlConverter
的转换器,但是没有公开处理方法。 我应该如何传递doc文件并获取HTML文件(或OutputStream
)?
这段代码现在适合我!
HWPFDocumentCore wordDocument = WordToHtmlUtils.loadDoc(new FileInputStream("D:\\temp\\seo\\1.doc")); WordToHtmlConverter wordToHtmlConverter = new WordToHtmlConverter( DocumentBuilderFactory.newInstance().newDocumentBuilder() .newDocument()); wordToHtmlConverter.processDocument(wordDocument); Document htmlDocument = wordToHtmlConverter.getDocument(); ByteArrayOutputStream out = new ByteArrayOutputStream(); DOMSource domSource = new DOMSource(htmlDocument); StreamResult streamResult = new StreamResult(out); TransformerFactory tf = TransformerFactory.newInstance(); Transformer serializer = tf.newTransformer(); serializer.setOutputProperty(OutputKeys.ENCODING, "UTF-8"); serializer.setOutputProperty(OutputKeys.INDENT, "yes"); serializer.setOutputProperty(OutputKeys.METHOD, "html"); serializer.transform(domSource, streamResult); out.close(); String result = new String(out.toByteArray()); System.out.println(result);
- 从自定义类导入时,无法对XMLBeans jar进行签名
- 在Apache POI中打开EXISTING xls
- Java POI Excel创建新列和新行
- 计算Word文档中的页面
- java.lang.NoClassDefFoundError:org / openxmlformats / schemas / spreadsheetml / x2006 / main / CTWorkbook $ Factory
- 如何在Apache POI XWPF文档中创建项目符号列表?
- 如何将货币类型设置为POI中的列
- 无法使用Apache POI获取/设置Word文档(.doc)中的复选框值
- 如何在apache poi 3.6中创建新行?