org.apache.lucene.util.fst.Util.toUTF16()方法的使用及代码示例

x33g5p2x  于2022-02-01 转载在 其他  
字(6.4k)|赞(0)|评价(0)|浏览(168)

本文整理了Java中org.apache.lucene.util.fst.Util.toUTF16()方法的一些代码示例,展示了Util.toUTF16()的具体用法。这些代码示例主要来源于Github/Stackoverflow/Maven等平台,是从一些精选项目中提取出来的代码,具有较强的参考意义,能在一定程度帮忙到你。Util.toUTF16()方法的具体详情如下:
包路径:org.apache.lucene.util.fst.Util
类名称:Util
方法名:toUTF16

Util.toUTF16介绍

[英]Just maps each UTF16 unit (char) to the ints in an IntsRef.
[中]只需将每个UTF16单元(字符)映射到IntsRef中的INT。

代码示例

代码示例来源:origin: org.apache.lucene/lucene-analyzers-common

  1. private FST<CharsRef> parseConversions(LineNumberReader reader, int num) throws IOException, ParseException {
  2. Map<String,String> mappings = new TreeMap<>();
  3. for (int i = 0; i < num; i++) {
  4. String line = reader.readLine();
  5. String parts[] = line.split("\\s+");
  6. if (parts.length != 3) {
  7. throw new ParseException("invalid syntax: " + line, reader.getLineNumber());
  8. }
  9. if (mappings.put(parts[1], parts[2]) != null) {
  10. throw new IllegalStateException("duplicate mapping specified for: " + parts[1]);
  11. }
  12. }
  13. Outputs<CharsRef> outputs = CharSequenceOutputs.getSingleton();
  14. Builder<CharsRef> builder = new Builder<>(FST.INPUT_TYPE.BYTE2, outputs);
  15. IntsRefBuilder scratchInts = new IntsRefBuilder();
  16. for (Map.Entry<String,String> entry : mappings.entrySet()) {
  17. Util.toUTF16(entry.getKey(), scratchInts);
  18. builder.add(scratchInts.get(), new CharsRef(entry.getValue()));
  19. }
  20. return builder.finish();
  21. }

代码示例来源:origin: org.apache.lucene/lucene-analyzers-common

  1. /** Builds the NormalizeCharMap; call this once you
  2. * are done calling {@link #add}. */
  3. public NormalizeCharMap build() {
  4. final FST<CharsRef> map;
  5. try {
  6. final Outputs<CharsRef> outputs = CharSequenceOutputs.getSingleton();
  7. final org.apache.lucene.util.fst.Builder<CharsRef> builder = new org.apache.lucene.util.fst.Builder<>(FST.INPUT_TYPE.BYTE2, outputs);
  8. final IntsRefBuilder scratch = new IntsRefBuilder();
  9. for(Map.Entry<String,String> ent : pendingPairs.entrySet()) {
  10. builder.add(Util.toUTF16(ent.getKey(), scratch),
  11. new CharsRef(ent.getValue()));
  12. }
  13. map = builder.finish();
  14. pendingPairs.clear();
  15. } catch (IOException ioe) {
  16. // Bogus FST IOExceptions!! (will never happen)
  17. throw new RuntimeException(ioe);
  18. }
  19. return new NormalizeCharMap(map);
  20. }
  21. }

代码示例来源:origin: harbby/presto-connectors

  1. private FST<CharsRef> parseConversions(LineNumberReader reader, int num) throws IOException, ParseException {
  2. Map<String,String> mappings = new TreeMap<>();
  3. for (int i = 0; i < num; i++) {
  4. String line = reader.readLine();
  5. String parts[] = line.split("\\s+");
  6. if (parts.length != 3) {
  7. throw new ParseException("invalid syntax: " + line, reader.getLineNumber());
  8. }
  9. if (mappings.put(parts[1], parts[2]) != null) {
  10. throw new IllegalStateException("duplicate mapping specified for: " + parts[1]);
  11. }
  12. }
  13. Outputs<CharsRef> outputs = CharSequenceOutputs.getSingleton();
  14. Builder<CharsRef> builder = new Builder<>(FST.INPUT_TYPE.BYTE2, outputs);
  15. IntsRefBuilder scratchInts = new IntsRefBuilder();
  16. for (Map.Entry<String,String> entry : mappings.entrySet()) {
  17. Util.toUTF16(entry.getKey(), scratchInts);
  18. builder.add(scratchInts.get(), new CharsRef(entry.getValue()));
  19. }
  20. return builder.finish();
  21. }

代码示例来源:origin: org.infinispan/infinispan-embedded-query

  1. private FST<CharsRef> parseConversions(LineNumberReader reader, int num) throws IOException, ParseException {
  2. Map<String,String> mappings = new TreeMap<>();
  3. for (int i = 0; i < num; i++) {
  4. String line = reader.readLine();
  5. String parts[] = line.split("\\s+");
  6. if (parts.length != 3) {
  7. throw new ParseException("invalid syntax: " + line, reader.getLineNumber());
  8. }
  9. if (mappings.put(parts[1], parts[2]) != null) {
  10. throw new IllegalStateException("duplicate mapping specified for: " + parts[1]);
  11. }
  12. }
  13. Outputs<CharsRef> outputs = CharSequenceOutputs.getSingleton();
  14. Builder<CharsRef> builder = new Builder<>(FST.INPUT_TYPE.BYTE2, outputs);
  15. IntsRefBuilder scratchInts = new IntsRefBuilder();
  16. for (Map.Entry<String,String> entry : mappings.entrySet()) {
  17. Util.toUTF16(entry.getKey(), scratchInts);
  18. builder.add(scratchInts.get(), new CharsRef(entry.getValue()));
  19. }
  20. return builder.finish();
  21. }

代码示例来源:origin: NationalSecurityAgency/datawave

  1. Util.toUTF16(value, irBuilder);
  2. final IntsRef ints = irBuilder.get();
  3. synchronized (this.fst) {

代码示例来源:origin: org.infinispan/infinispan-embedded-query

  1. /** Builds the NormalizeCharMap; call this once you
  2. * are done calling {@link #add}. */
  3. public NormalizeCharMap build() {
  4. final FST<CharsRef> map;
  5. try {
  6. final Outputs<CharsRef> outputs = CharSequenceOutputs.getSingleton();
  7. final org.apache.lucene.util.fst.Builder<CharsRef> builder = new org.apache.lucene.util.fst.Builder<>(FST.INPUT_TYPE.BYTE2, outputs);
  8. final IntsRefBuilder scratch = new IntsRefBuilder();
  9. for(Map.Entry<String,String> ent : pendingPairs.entrySet()) {
  10. builder.add(Util.toUTF16(ent.getKey(), scratch),
  11. new CharsRef(ent.getValue()));
  12. }
  13. map = builder.finish();
  14. pendingPairs.clear();
  15. } catch (IOException ioe) {
  16. // Bogus FST IOExceptions!! (will never happen)
  17. throw new RuntimeException(ioe);
  18. }
  19. return new NormalizeCharMap(map);
  20. }
  21. }

代码示例来源:origin: harbby/presto-connectors

  1. /** Builds the NormalizeCharMap; call this once you
  2. * are done calling {@link #add}. */
  3. public NormalizeCharMap build() {
  4. final FST<CharsRef> map;
  5. try {
  6. final Outputs<CharsRef> outputs = CharSequenceOutputs.getSingleton();
  7. final org.apache.lucene.util.fst.Builder<CharsRef> builder = new org.apache.lucene.util.fst.Builder<>(FST.INPUT_TYPE.BYTE2, outputs);
  8. final IntsRefBuilder scratch = new IntsRefBuilder();
  9. for(Map.Entry<String,String> ent : pendingPairs.entrySet()) {
  10. builder.add(Util.toUTF16(ent.getKey(), scratch),
  11. new CharsRef(ent.getValue()));
  12. }
  13. map = builder.finish();
  14. pendingPairs.clear();
  15. } catch (IOException ioe) {
  16. // Bogus FST IOExceptions!! (will never happen)
  17. throw new RuntimeException(ioe);
  18. }
  19. return new NormalizeCharMap(map);
  20. }
  21. }

代码示例来源:origin: NationalSecurityAgency/datawave

  1. public static FST<?> getFST(SortedSet<String> values) throws IOException {
  2. final IntsRefBuilder irBuilder = new IntsRefBuilder();
  3. // The builder options with defaults
  4. FST.INPUT_TYPE inputType = FST.INPUT_TYPE.BYTE1;
  5. int minSuffixCount1 = 0;
  6. int minSuffixCount2 = 0;
  7. boolean doShareSuffix = true;
  8. boolean doShareNonSingletonNodes = true;
  9. int shareMaxTailLength = Integer.MAX_VALUE;
  10. boolean allowArrayArcs = true;
  11. int bytesPageBits = 15;
  12. final Outputs<Object> outputs = NoOutputs.getSingleton();
  13. // create the FST from the values
  14. org.apache.lucene.util.fst.Builder<Object> fstBuilder = new org.apache.lucene.util.fst.Builder<>(inputType, minSuffixCount1, minSuffixCount2,
  15. doShareSuffix, doShareNonSingletonNodes, shareMaxTailLength, outputs, allowArrayArcs, bytesPageBits);
  16. for (String value : values) {
  17. Util.toUTF16(value, irBuilder);
  18. final IntsRef scratchInt = irBuilder.get();
  19. fstBuilder.add(scratchInt, outputs.getNoOutput());
  20. }
  21. return fstBuilder.finish();
  22. }

相关文章