No problem Jarlve. This stuff is very time consuming and hard work. I was taking a week off to paint my entire house... grew a short beard and got one wall painted.
Where messages are found
Yes, I agree that we should take into consideration where in the message cycles are found. On the one hand, examining only part of the message will eliminate a lot of false cycles. But on the other hand, examining the message as a whole will show us what cycles continue throughout and are more likely to be Z made.
Test cipher simulating wildcard, other team(?) and distribution of diffusion
If you want, you can delegate some tasks to me. I could make a cipher on my cipher key spreadsheet that simulates wildcard for you. If you feel more comfortable doing it yourself, that's fine. I understand if you have something very specific in mind and feel more comfortable with that.
I found the Zodiac Killer Ciphers website this morning. I saw Jarlve's reference in his last post about another team working on the same concept (?) and did a quick search but didn't see anything. Can you point me in the right direction? ***
But on the website home page, doranchak was writing about making experimental ciphers to test any particular hypothesis. Now back to the discussion of Z340.
Regardless of who makes the test cipher, do you guys think that Z used higher L counts to diffuse higher frequency letters? What I mean is, in my analysis from yesterday, I found several symbols with count of 10 or 11. Some cycle with each other, and some do not, just looking at two symbols comparisons. Four of these symbols could be e, three t or a. Out of curiosity, how would any test cipher distribute the diffusion? We know that many of the symbols with count of 3, 4, 5 or 6 cycle with other symbols. But what to do with the high count symbols?
It sounds like reading Jarlve's recent posts that if Zodiac used low L cycles for high frequency letters, then the message would have been solved by now.
Are the high count symbols in low L cycles that represent high frequency letters? Or what about in higher L cycles with randomized selection of intermediate symbols (ABCD ABCD ACBD ABCD ACBD)? Or could they be wildcards that are cycles with each other? If there are too many wildcards, doesn't that minimize the confidence that we have in any solution? Probably depends on how many. You guys don't have to answer that one in highly technical detail.
Another way to look at the symbols and cycles both separately and together?
I was looking at the scatter graph and if you look at symbols with say, count of 2, 3, 4 and 5. If you look at the distribution of dots in each column, there is a pattern there as well. Do you see it? ** *** * * * (sideways).
The lower dots represent symbols that have low cycle scores, and the higher dots represent symbols that have high cycle scores. I am thinking that the higher dots may represent symbols that are in higher L cycles because they have higher total cycle scores. What do you guys think about that idea? Anyway, perhaps a different scoring formula would yield new information.
Or what about this idea. Use this or another method to score each individual symbol, and use that score to help score the cycles? Let's say Symbol X scores higher in the scattergraph, and is found in cycles ABAB and ABCD. Does that affect whether ABAB or ABCD score higher? But then take into account scores for A, C and D in the formula as well. Could that help flush out the Z made cycles?
EDIT: Symbol B not Symbol X
Get some rest, Jarlve, and thanks very much for what you have done so far. I will consider shaving and painting another wall. Let me know about the test cipher. We could simulate the analysis two or three posts above, or something similar. Whatever you want to do.
S.T.

