Methods & Validation
Mathematical models, algorithmic assumptions, and validation datasets. Built for transparency and academic review.
IPA Extraction Method
The engine utilizes a deterministic rule-based heuristic to convert English orthography into International Phonetic Alphabet (IPA) tokens. Unlike neural approaches, this ensures consistent reproducibility.
// Example Heuristic Rule (simplified)
if (char == 'C') {
if (nextChar is 'E', 'I', 'Y') return /s/ // Fricative
else if (nextChar is 'H') return /tʃ/ // Affricate
else return /k/ // Plosive
}Limitation: This model assumes standard American English pronunciation. Divergent pronunciations (e.g., "Siobhan" /ʃəˈvɔːn/) are approximated based on grapheme frequency.
Sonority Hierarchy Model
Rhythm scores are calculated using a 10-point Sonority Scale derived from Clements (1990). This measures the inherent "loudness" or "openness" of a speech sound.
| Class | Examples | Value |
|---|---|---|
| Low Vowels | /a/, /æ/ ("Cat") | 10 |
| Mid Vowels | /e/, /o/ ("Met") | 9 |
| High Vowels | /i/, /u/ ("Bee") | 8 |
| Glides | /w/, /y/ ("Yes") | 7 |
| Liquids | /l/, /r/ ("Run") | 6 |
| Nasals | /m/, /n/ ("No") | 5 |
| Voiced Fricatives | /z/, /v/ ("Zoo") | 4 |
| Voiceless Fricatives | /s/, /f/ ("Sit") | 3 |
| Voiced Plosives | /b/, /d/, /g/ ("Bed") | 2 |
| Voiceless Plosives | /p/, /t/, /k/ ("Top") | 1 |
Information Entropy
We calculate the "Distinctiveness" of a name using Shannon Entropy (bits). This measures information content relative to standard English frequency.
Shannon Entropy Formula
H(X) = -Σ p(x) log₂(p(x))
Probability of character x in the English corpus (OEC)
High Entropy = Rare combinations (e.g., "Xryz")
Low Entropy = Common patterns (e.g., "Anna")
Keyboard Ergonomics
The "Ease of Typing" score is derived from a graph-traversal model of the QWERTY layout. We calculate Euclidean distance between sequential keystrokes.
Alternation Bonus
Typing speed increases when hands alternate (Left → Right → Left). Names like "DORIS" receive a Flow bonus.
Inward Roll Advantage
Movement from pinky to index finger (Inward Roll) is biomechanically faster than outward movement.
Acoustic Simulation
"Mouth Feel" is visualized by mapping phonemes to their Place of Articulation on a 2D Cartesian plane:
A vector sum determines if the name is "Projective" (net movement forward) or "Ingestive" (net movement backward).
Statistical Limitations
Correlation ≠ Causation
The "First Impression" metrics are based on statistical averages from population studies (e.g., Implicit Egotism). They describe aggregate sociolinguistic tendencies, not individual destiny or character. A person named "Kate" (High Competence Score) is not guaranteed to be competent, but the sound of the name carries those cultural associations.
📜 Scientific References
All algorithms in this application are derived from these peer-reviewed publications.