暂无图片
暂无图片
暂无图片
暂无图片
暂无图片

GBase 8c 函数和操作符 - 文本检索函数和操作符 之 文本检索调试函数

菁菁 2022-10-26
237

  •  ts_debug([ config regconfig, ] document text, OUT alias text, OUT description text, OUT token text, OUT dictionaries regdictionary[], OUT dictionary regdictionary, OUT lexemes text[])

描述:测试一个配置。

返回类型:setof record

示例:

 gbase=# SELECT ts_debug('english', 'The Brightest supernovaes');

                                     ts_debug                                      

-----------------------------------------------------------------------------------

 (asciiword,"Word, all ASCII",The,{english_stem},english_stem,{})

 (blank,"Space symbols"," ",{},,)

 (asciiword,"Word, all ASCII",Brightest,{english_stem},english_stem,{brightest})

 (blank,"Space symbols"," ",{},,)

 (asciiword,"Word, all ASCII",supernovaes,{english_stem},english_stem,{supernova})

(5 rows)

  •  ts_lexize(dict regdictionary, token text)

描述:测试一个数据字典。

返回类型:text[]

示例:

 gbase=# SELECT ts_lexize('english_stem', 'stars');

 ts_lexize

-----------

 {star}

(1 row)

  •  ts_parse(parser_name text, document text, OUT tokid integer, OUT token text)

描述:测试一个解析。

返回类型:setof record

示例:

 gbase=# SELECT ts_parse('default', 'foo - bar');

 ts_parse  

-----------

 (1,foo)

 (12," ")

 (12,"- ")

 (1,bar)

(4 rows)

  •  ts_parse(parser_oid oid, document text, OUT tokid integer, OUT token text)

描述:测试一个解析。

返回类型:setof record

示例:

 gbase=# SELECT ts_parse(3722, 'foo - bar');

 ts_parse  

-----------

 (1,foo)

 (12," ")

 (12,"- ")

 (1,bar)

(4 rows)

  •  ts_token_type(parser_name text, OUT tokid integer, OUT alias text, OUT description text)

描述:获取分析器定义的记号类型。

返回类型:setof record

示例:

 gbase=# SELECT ts_token_type('default');

                        ts_token_type                         

--------------------------------------------------------------

 (1,asciiword,"Word, all ASCII")

 (2,word,"Word, all letters")

 (3,numword,"Word, letters and digits")

 (4,email,"Email address")

 (5,url,URL)

 (6,host,Host)

 (7,sfloat,"Scientific notation")

 (8,version,"Version number")

 (9,hword_numpart,"Hyphenated word part, letters and digits")

 (10,hword_part,"Hyphenated word part, all letters")

 (11,hword_asciipart,"Hyphenated word part, all ASCII")

 (12,blank,"Space symbols")

 (13,tag,"XML tag")

 (14,protocol,"Protocol head")

 (15,numhword,"Hyphenated word, letters and digits")

 (16,asciihword,"Hyphenated word, all ASCII")

 (17,hword,"Hyphenated word, all letters")

 (18,url_path,"URL path")

 (19,file,"File or path name")

 (20,float,"Decimal notation")

 (21,int,"Signed integer")

 (22,uint,"Unsigned integer")

 (23,entity,"XML entity")

(23 rows)

  •  ts_token_type(parser_oid oid, OUT tokid integer, OUT alias text, OUT description text)

描述:获取分析器定义的记号类型。

返回类型:setof record

示例:

 gbase=# SELECT ts_token_type(3722);

                        ts_token_type                         

--------------------------------------------------------------

 (1,asciiword,"Word, all ASCII")

 (2,word,"Word, all letters")

 (3,numword,"Word, letters and digits")

 (4,email,"Email address")

 (5,url,URL)

 (6,host,Host)

 (7,sfloat,"Scientific notation")

 (8,version,"Version number")

 (9,hword_numpart,"Hyphenated word part, letters and digits")

 (10,hword_part,"Hyphenated word part, all letters")

 (11,hword_asciipart,"Hyphenated word part, all ASCII")

 (12,blank,"Space symbols")

 (13,tag,"XML tag")

 (14,protocol,"Protocol head")

 (15,numhword,"Hyphenated word, letters and digits")

 (16,asciihword,"Hyphenated word, all ASCII")

 (17,hword,"Hyphenated word, all letters")

 (18,url_path,"URL path")

 (19,file,"File or path name")

 (20,float,"Decimal notation")

 (21,int,"Signed integer")

 (22,uint,"Unsigned integer")

 (23,entity,"XML entity")

(23 rows)

  •  ts_stat(sqlquery text, [ weights text, ] OUT word text, OUT ndoc integer, OUT nentry integer)

描述:获取tsvector列的统计数据。

返回类型:setof record

示例:

 gbase=# SELECT ts_stat('select ''hello world''::tsvector');

   ts_stat   

-------------

 (world,1,1)

 (hello,1,1)

(2 rows)

「喜欢这篇文章,您的关注和赞赏是给作者最好的鼓励」
关注作者
【版权声明】本文为墨天轮用户原创内容,转载时必须标注文章的来源(墨天轮),文章链接,文章作者等基本信息,否则作者和墨天轮有权追究责任。如果您发现墨天轮中有涉嫌抄袭或者侵权的内容,欢迎发送邮件至:contact@modb.pro进行举报,并提供相关证据,一经查实,墨天轮将立刻删除相关内容。

评论