sentencepiece/sentencepiece.spec
2021-11-27 19:37:28 +08:00

77 lines
2.8 KiB
RPMSpec

Name: sentencepiece
Version: 0.1.92
Release: 6
Summary: An unsupervised text tokenizer and detokenizer
License: Apache-2.0
URL: https://github.com/google/sentencepiece
Source0: https://github.com/google/sentencepiece/archive/v%{version}.tar.gz
Patch0: Removed-codes-where-Zero-Width-Joiner-replaced-with-.patch
Patch1: fix_of_an_unattainable_condition.patch
Patch2: Added-split_digits-to-SentencePieceTrainer.patch
Patch3: sentencepiece.pc-should-be-installed-from-CMAKE_CURR.patch
Patch4: Create-options.md.patch
Patch5: Fix-FTBFS-on-armel-mips-powerpc-m68k-and-sh4.patch
Patch6: Add-missing-include-for-BYTE_ORDER.patch
Patch7: only-install-proto-headers-if-not-using-builtin-prot.patch
Patch8: Restore-the-sentence-boundary-marker-insertion-for-t.patch
BuildRequires: gcc-c++ gcc autoconf pkgconfig protobuf-compiler protobuf
BuildRequires: cmake >= 3.14.0
Requires: protobuf protobuf-compiler
%description
SentencePiece is an unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation
systems where the vocabulary size is predetermined prior to the neural model training.
%global debug_package %{nil}
%prep
%autosetup -p1 -n %{name}-%{version}
%build
mkdir -p cmake/build
cd cmake/build
cmake ../../ -DCMAKE_CXX_FLAGS="-D_FORTIFY_SOURCE=2 -O2 -Wno-unused-result -Wno-sign-compare"\
-DCMAKE_C_FLAGS="-D_FORTIFY_SOURCE=2"\
-DCMAKE_BUILD_TYPE=Release\
-DSPM_USE_BUILTIN_PROTOBUF=ON\
-DSPM_ENABLE_SHARED=OFF\
-DSPM_BUILD_TEST=ON\
-DCMAKE_INSTALL_LIBDIR=%{buildroot}%{_libdir}\
-DCMAKE_INSTALL_BINDIR=%{buildroot}%{_bindir}\
-DCMAKE_INSTALL_INCDIR=%{buildroot}%{_includedir}
make -j24 V=1
make CTEST_OUTPUT_ON_FAILURE=1 test
cd ../../
%install
cd cmake/build
make install
%files
%defattr(-,root,root)
%{_bindir}/spm_*
%{_libdir}/*.a
%{_libdir}/pkgconfig/*
%{_includedir}/sentencepiece*.h
%changelog
* Fri Nov 27 2021 xiefangqi <xiefangqi2@huawei.com> - 0.1.92.6
- Fix split_digits support to SentencepieceTrainer spec parser
- Add sentencepiece.pc install
- Add spm_train --help option
- Fix FTBFS problem on armel/mips/powerpc/m68k/sh4
- Fix endian problem on android plarform
- Fix pb protobuf header file can't find problem
- Restore the sentence boundary
* Tue Nov 16 2021 xiefangqi <xiefangqi2@huawei.com> - 0.1.92.5
- add README.md/README.en.md
* Tue Nov 2 2021 xiefangqi <xiefangqi2@huawei.com> - 0.1.92-4
- fix of an unattainable condition
* Tue Nov 2 2021 xiefangqi <xiefangqi2@huawei.com> - 0.1.92-3
- Prevent Zero Width Joiner replaced with whitespace
* Wed Sep 29 2021 xiefangqi <xiefangqi2@huawei.com> - 0.1.92-2
- add test cases
* Wed Sep 23 2021 xiefangqi <xiefangqi2@huawei.com> - 0.1.92-1
- package init