일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 |
8 | 9 | 10 | 11 | 12 | 13 | 14 |
15 | 16 | 17 | 18 | 19 | 20 | 21 |
22 | 23 | 24 | 25 | 26 | 27 | 28 |
29 | 30 | 31 |
Tags
- Gradle
- linux
- it
- elasticsearch
- Design Patterns
- devops
- Git
- ReactJS
- jsp
- Spring
- JVM
- Web Server
- laravel
- tool
- jenkins
- java
- redis
- Spring Batch
- IntelliJ
- 요리
- AWS
- MySQL
- springboot
- Spring Boot
- db
- 맛집
- Oracle
- javascript
- ubuntu
- php
Archives
- Today
- Total
아무거나
Java에서 Apache OpenOffice + JODConverter 를 활용한 PDF Converter 개발 본문
Java & Kotlin/Java
Java에서 Apache OpenOffice + JODConverter 를 활용한 PDF Converter 개발
전봉근 2020. 6. 17. 01:37반응형
Java에서 Apache OpenOffice + JODConverter 를 활용한 PDF Converter 개발
Apache OpenOffice란 다양한 운영체제에서 사용하 수 있는 오피스 제품이다.(오픈소스) -> 현재는 버전업 속도가 빠른 리브레오피스(LibreOffice)를 추천 현재 라이브러리는 RedHat Linux 기준으로 작성되었다.
OpenOffice v4.0.0 기준으로 작성되었다. (그 이상 버전에서는 jre 관련 에러가 표시되어 다운그레이드함)
- build.gradle에 의존성 추가
dependencies { ... // Windows compile group: 'org.jodconverter', name: 'jodconverter-core', version: '4.0.0-RELEASE' // Linux compile group: 'com.artofsolving', name: 'jodconverter', version: '2.2.0' }
- OpenOffice 설치(v4.0.0)
- Windows
// .exe 파일 설치하면 완료
- Linux
- 다운 후 압축해제
$ wget https://sourceforge.net/projects/openofficeorg.mirror/files/4.0.0/binaries/ko/Apache_OpenOffice_4.0.0_Linux_x86-64_install-rpm_ko.tar.gz/download -O Apache_OpenOffice_4.0.0_Linux_x86-64_install-rpm_ko.tar.gz $ sudo yum remove openoffice* libreoffice* $ tar -xvf Apache_OpenOffice_4.0.0* $ cd ko
- sudo vi /etc/yum.conf [yum.conf]
[yum.conf] ## Add exclude row [main] exclude=openoffice.org-ure* libreoffice-ure*
- 설치
$ sudo rpm -Uvh RPMS/*.rpm RPMS/desktop-integration/openoffice4.0-redhat-*.rpm $ soffice -headless -accept="socket,host=127.0.0.1,port=8100;urp;tcpNoDelay=1" -nofirststartwizard --convert-to &
- Java 코드 작성
import lombok.extern.slf4j.Slf4j; import org.jodconverter.OfficeDocumentConverter; import org.jodconverter.office.DefaultOfficeManagerBuilder; import org.jodconverter.office.OfficeException; import org.jodconverter.office.OfficeManager; import java.io.File; import java.io.FileInputStream; import java.io.IOException; @Slf4j public class FileToPdfConverter { private FileToPdfConverter() { throw new IllegalStateException("Utility class"); } public static boolean setFileToPdfParse(String origFilePath, String parseFilePath) throws OfficeException, IOException { boolean result = true; if (origFilePath.isEmpty() || parseFilePath.isEmpty()) { log.info(">>>>>>>>>>>>>>>>>>>>>>>>> setFileToPdfParse Parameter Empty ERROR !!"); result = false; } log.info(">>>>>>>>>>>>>>>>>>>>>>>>> encoding: " + System.getProperty("file.encoding")); log.info(">>>>>>>>>>>>>>>>>>>>>>>>> origFilePath: " + getFileEncoding(origFilePath)); File origFile = new File(origFilePath); File parseFile = new File(parseFilePath); DefaultOfficeManagerBuilder builder = new DefaultOfficeManagerBuilder(); builder.setPortNumber(8100); builder.setOfficeHome(new File("/opt/openoffice4")); builder.setTaskExecutionTimeout(600000L); // 10 minutes builder.setMaxTasksPerProcess(2); OfficeManager officeManager = builder.build(); try { log.info(">>>>>>>>>>>>>>>>>>>>>>>>> [PDF Parser Util] Start !!"); officeManager.start(); OfficeDocumentConverter converter = new OfficeDocumentConverter(officeManager); converter.convert(origFile, parseFile); log.info(">>>>>>>>>>>>>>>>>>>>>>>>> [PDF Parser Util] End !!"); } catch (Exception e) { if (log.isErrorEnabled()) { log.error(">>>>>>>>>>>>>>>>>>>>>>>>> setFileToPdfParse ERROR !! {}", e.getMessage()); } } finally { officeManager.stop(); origFile.delete(); } return result; } // Windows Test public static void main(String[] args) throws Exception { OfficeManager officeManager = new DefaultOfficeManagerBuilder().build(); String origPath = "./fasoo_drm/contents/cal.xls"; String newPath = "./fasoo_drm/contents/okok.pdf"; File origFile = new File(origPath); File newFile = new File(newPath); try { log.info(">>>>>>>>>>>>>>>>>>>>>>>>> encoding: " + System.getProperty("file.encoding")); log.info(">>>>>>>>>>>>>>>>>>>>>>>>> origFilePath: " + getFileEncoding(origPath)); officeManager.start(); OfficeDocumentConverter converter = new OfficeDocumentConverter(officeManager); converter.convert(origFile, newFile); officeManager.stop(); origFile.delete(); log.info(">>>>>>>>>>>>>>>>>>>>>>>>> origFile Delete !!"); } catch (Exception e) { if (log.isErrorEnabled()) { log.error(">>>>>>>>>>>>>>>>>>>>>>>>> FileToPdfParser ERROR !! {}", e.getMessage()); } } } // encoding return private static String getFileEncoding(String filePath) { String fileEncodingStr = "EUC-KR"; try { FileInputStream fis = new FileInputStream(filePath); byte[] BOM = new byte[4]; fis.read(BOM, 0, 4); if ((BOM[0] & 0xFF) == 0xEF && (BOM[1] & 0xFF) == 0xBB && (BOM[2] & 0xFF) == 0xBF) { fileEncodingStr = "UTF-8"; } else if ((BOM[0] & 0xFF) == 0xFE && (BOM[1] & 0xFF) == 0xFF) { fileEncodingStr = "UTF-16BE"; } else if ((BOM[0] & 0xFF) == 0xFF && (BOM[1] & 0xFF) == 0xFE) { fileEncodingStr = "UTF-16LE"; } else if ( (BOM[0] & 0xFF) == 0x00 && (BOM[1] & 0xFF) == 0x00 && (BOM[0] & 0xFF) == 0xFE && (BOM[1] & 0xFF) == 0xFF ) { fileEncodingStr = "UTF-32BE"; } else if ( (BOM[0] & 0xFF) == 0xFF && (BOM[1] & 0xFF) == 0xFE && (BOM[0] & 0xFF) == 0x00 && (BOM[1] & 0xFF) == 0x00 ) { fileEncodingStr = "UTF-32LE"; } } catch (Exception e) { if (log.isErrorEnabled()) { log.error("fileEncodingChk ERROR !! {}", e.getMessage()); } } return fileEncodingStr; } }
- 문제해결
- 만약 한글이 ??로 깨져서 표시될 때 Font issue일 가능성이 농후
- http://cdn.naver.com/naver/NanumFont/fontfiles/NanumFont_TTF_ALL.zip 에서 폰트 다운
- openoffice 설치 경로인 /opt/openoffice4/share/fonts/truetype 에 .ttf 폰트파일 이동
- sudo fc-cache -r 폰트적용
- Font가 적용되었음에도 한글이 ??로 깨져서 표시될 때
- soffice 프로세스가 죽어있는 경우 한글이 깨질 수 있음 -> 다시 실행
- 만약 한글이 ??로 깨져서 표시될 때 Font issue일 가능성이 농후
- 다운 후 압축해제
- Windows
반응형
'Java & Kotlin > Java' 카테고리의 다른 글
Annotation 설명 및 실습 (0) | 2021.05.05 |
---|---|
리플렉션(Reflection) 이란? (0) | 2021.05.03 |
파일복사 (0) | 2020.06.06 |
[Swagger] java.lang.NumberFormatException: For input string 오류 표시 해결 (1) | 2020.05.30 |
Stream 정렬 관련 Example (0) | 2020.05.28 |
Comments