联邦调查局目前使用模拟机制打印和编辑文件。我很想知道是否有人研究过推断可能的单词或名字的准确性。当然,考虑到可变宽度字体集中字符的静态宽度、有限的可用空间以及字符序列的可能性,这应该是可行的。
以下是摘自第61页的文档的片段。
片段读起来,
(U//LES) An identified `<REDACTED>` as of October planned to engage in sniper attacks
against protestors in Houston, Texas, if deemed necessary. An identified `<REDACTED>` had
received intelligence that indicated the protesters in New York and Seattle planned similar protests in
Houston, Dallas, San Antonio, and Austin, Texas. `<REDACTED>` planned to gather intelligence against
the leaders of the protest groups and obtain photographs, then formulate a plan to kill the leadership via
suppressed sniper rifles. (Note: protests continued throughout the weekend with approximately 6000
persons in NYC. "Occupy Wall Street" protests have spread to about half of all states in the US, over a
dozen European and Asian cities, including protests in Cleveland 10/6-8/11 at Willard Park which was
initially attended by hundreds of protestors).
一个小片段覆盖了拉丁文字符,
"(),-./0168:<>ACDEHLNOPRSTUWYabcdefghiklmnoprstuvwxyze
我相信其他人也可以从文件里被抓走。
我拿了最后几行(没有编辑),并在他们的副本,在新罗马时代相同的文本。我确信字体是Times,但是扫描不好,或者是做了一些事情来模糊对齐。
此漏洞是根据可变宽度图形进行预测的。其中许多词可能因缺乏连贯性而被取消资格,而另一些则因其组成字符大小而被取消资格。对于可变宽度的字体,每个字形都有自己的大小特征,虽然肯定会有碰撞,但我认为它们可以做很多事情来填补空白。举个例子,这里有两个七个字母的单词--我们可以看到其中一个字母要大得多:
然而,看看先期的时代新罗马,我不知道这将有多有用。
"512" : "."
"569" : "ijlt"
"682" : "-frI"
"797" : "sJ"
"909" : "acez"
"1024" : "bdghknopquvxy0123456789"
"1139" : "FPS"
"1251" : "ELTZ"
"1366" : "BCR"
"1479" : "wADGHKNOQUVXY"
"1593" : "m"
"1821" : "M"
"1933" : "W"
我们在这里可以看到,在[A-Za-z0-9.-]
字符类中,有13个不同的可能长度。
发布于 2014-04-07 07:30:08
有太多的可能组合,任何合理大小的编辑,这是有用的。当然,您可以使用间距分析来拒绝可能的值,但这并不是生成候选值的实用方法(例如,4096的预付款可能是60个单词中的任何一个;如果允许10%的错位,则为299个单词)。
https://security.stackexchange.com/questions/54972
复制相似问题