Physics Derivation Graph navigation Sign in

Steps to convert a published scientific paper into the Physics Derivation Graph

Published 2021-08-08T15:15:00.002Z by Physics Derivation Graph

This post documents the steps taken to get the derivations in the paper Speed of sound from fundamental physical constants into the Physics Derivation Graph. 

1) Read the paper. I need to understand the point of the paper and where content is in the text. 

2) Identify the primary result of the derivation. Which equation is the output? Is there more than one? In this paper, equation 1 is the primary result.

Tangent: I tested the question of whether the most popular equation is the most important.
This is measurable using the command

cat sound1.tex | tr " " "\n" |\          # replace spaces with newlines -- word tokenization
   grep "ref{" |\                        # find internal references to labeled expressions
   sed 's/.*\\ref{//' | sed 's/}.*//' |\ # remove the irrelevant latex "\ref{}" to isolate the expression label ID
   sort | uniq -c | sort -n              # rank the popularity of expression labels
2 v001
3 abinitio
3 all
3 ratio
3 v0
3 v01
3 v1
5 elemental
6 bohr
6 rydberg
11 a
15 v00
15 v3
Decoding those latex labels to equations in the PDF, "v3" is equation 9, which is tied for most popular with "v00" -- equation 4. The second most popular label "a" is equation 10. The equation I'm claiming as the primary result, equation 1, is labeled as "v0" and is referenced 3 times.

3) What sections of the paper contain the derivation?
There are actually two derivations of the same result.

4) On paper, write out equations from the paper. Here I'm limiting the scope to one derivation on page 2 column 1. Fill in additional equations that are missing from the text. Use arrows to denote flow of derivation.

Often I'll make math mistakes or go down paths that turn out to be irrelevant. If I were a typical researcher, this is where the reconstruction process would stop.

5) On a separate paper, rewrite the graph and label the edges in the directed graph to describe the operation being carried out. 

6) On a separate paper, rewrite the content from the previous step to refine the "operation being carried out" to a directed graph with inference rules available in the Physics Derivation Graph. 

7) Enter the symbols used in this derivation in or identify pre-existing symbols and record their PDG ID. 

8) If novel inference rules are needed for this derivation, enter those in

9) Using, enter the expressions and inference rules for the steps of the derivation. For each step, 

The output of this step is

10) Offline, in the JSON file containing the derivation, edit the symbol IDs and SymPy representations as needed to get the step validations to be correct. 

Review the modifications on

TODO: The extensive linking of variables to definitions is not made accessible in the output, so there is no way for the user to leverage the linked information. 

As a result of the above observation, was opened.