vendredi 2 août 2019

How to discover patterns in sequences of numbers in R

I have a list in R in which every single row is a half verse of bible text. The columns are: B(ook), C(chapter), V(erse), H(alf verse), and a1–a31. These columns with a plus an integer are codes that represent Hebrew cantillation marks.

What I need is a way to find patterns in the sequences of the numbers that tell me which combinations of integers occurs and how many times.

E.g.: how many times is 74 followed by 63; how many times is 63 preceded by 05.

Ideally it would also tell me combination of more than two. E.g.: how many times is 74 preceded by 05 which is preceded by 35.

Finally I'd need to chart this in some way.

Below are the header and the first 3 rows of the list.

B,C,V,H,a1,a2,a3,a4,a5,a6,a7,a8,a9,a10,a11,a12,a13,a14,a15,a16,a17,a18,a19,a20,a21,a22,a23,a24,a25,a26,a27,a28,a29,a30,a31

Genesis,1,1,A,73,74,92

Genesis,1,1,B,71,73,71,00

Genesis,1,2,A,81,71,3303,80,73,74,92

Aucun commentaire:

Enregistrer un commentaire