Recently I decided to figure out how I can use a large language model to read engineering and construction documents.
In this industry we pass around a ridiculous amount of pdf’s.
It is often useful to plot data as a histogram with a representative probability density function (PDF).
Over the past year I have spent a significant amount of time determining the geotechnical parameters to use for modeling and other analyses.
In 2004 Bent Flyvbjerg published a guidance document titled Procedures for Dealing with Optimism Bias in Transport Planning. He’s got a lot of good analysis and plots, but one in particular brought a question to mind:
Over the last year I have had the privilege of working on some of the largest tunneling projects in North America. Most recently, two of the first single-bore transit tunnels to be built here: Toronto’s Scarborough Subway Extension (technically, I have just been working on the stations, not the tunnel) and San Francisco’s Silicon Valley Extension.
In geotechnical engineering we commonly have to determine the load the ground exerts on a structure. Two of the main parameters used to determine that are the coefficient of lateral earth pressure (
The geotechnical data management software I use at work has report templates that perform SQL joins to relate test data to geologic strata. For example, if a test is performed at 20 feet, it looks at the strata data (borelogs) and determines what code (like sandstone
or limestone
) to assign. Unfortunately, the software only provides reports for limited data types and offer zero customization of the output. And, they do not provide a way for me to write my own SQL queries. Thankfully, I have been learning a little python (specifically the pandas
library), and found a pretty good solution.
In 1971 two psychologists wrote a paper titled “Belief in the Law of Small Numbers”. Their thesis is that individuals (both trained scientists and laypeople) consider a sample (a set of data) drawn from a population to be more representative of the population than they should, especially when the sample is small. In other words, when dealing with small data sets, we will erroneously assume that our “mean” value is very representative of the population mean.
When I am not working at work, I am usually thinking about (and tinkering with) ways to adopt things from other industries to make us better. Most of these just sit on the shelves in my mind with little to no action taken (my excuse is that they would require too much behavior change from other people, the hardest thing in the world).