Tíquete #33551

Bit vectors exclude a match they shouldn't

: 2014-03-22 01:58 Última Atualização: 2014-03-22 04:13

Relator:
Dono:
Tipo:
Estado:
Fechado
Componente:
Prioridade:
9 - Highest
Gravidade:
5 - Medium
Resolução:
Fixed
Arquivo:
Nenhum

Details

The dictionary entry "【气】⿳<𠂉>⿰丿一一乁" should be matched by the query "|⺡⿳⿰丿一一乁", but isn't matched when a bit vector index is used. It specifically seems to be excluded by the BDD test. This represents a failure of correctness of the bit vector algorithm, and is high-priority to nail down and fix because it's a blocking issue for a related academic paper. Once fixed, a new release of IDSgrep will be in order.

Ticket History (2/2 Histories)

2014-03-22 01:58 Updated by: mskala
  • New Ticket "Bit vectors exclude a match they shouldn't" created
2014-03-22 04:13 Updated by: mskala
  • Resolução Update from Nenhum to Fixed
  • Estado Update from Aberto to Fechado
  • Ticket Close date is changed to 2014-03-22 04:13
Comentário

Fixed by recent commit. It was one character wrong in an array index for calculating the bit vectors of ternary roots, causing the first child of the first child of a ternary root to be recorded as a child, not grandchild, of the root. Very rare that this would make a difference, but the case described in the report happens to depend on it.

Attachment File List

No attachments

Editar

You are not logged in. I you are not logged in, your comment will be treated as an anonymous post. » Login