Scheme
Scheme是一种函数式编程语言,是Lisp的两种主要方言之一,不同于与之并列的Common Lisp,Scheme遵循极简主义哲学,以一个小型语言核心作为标准,加上各种强力语言工具(语法糖)来扩展语言本身[19]。Scheme是第一个使用静态作用域的Lisp方言,也是第一个引入头等续体和“干净宏”的编程语言。
编程范型 | 多范型:函数式, 指令式, 元编程 |
---|---|
语言家族 | Lisp |
设计者 | 小盖伊·史提尔和杰拉德·杰伊·萨斯曼 |
发行时间 | 1975年 |
当前版本 |
|
类型系统 | 强类型,动态类型 |
作用域 | 词法 |
文件扩展名 | .scm .ss |
网站 | https://www.scheme.org/ |
主要实现产品 | |
支持R7RS:BiwaScheme[2], Chibi[3], Chicken, Cyclone[4], Foment[5], Gambit, Gauche, Gerbil[6], GNU Guile, Kawa, Larceny[7], LIPS[8], Loko[9], MIT/GNU Scheme, Mosh[10], Picrin[11], Rapid[12], Sagittarius[13], STklos, TR7[14], Ypsilon[15] 其他:Bigloo, Chez, IronScheme, GNU Mes[16], Scheme 48, SCM, TinyScheme | |
派生副语言 | |
femtolisp[17], Racket, SIOD, T | |
启发语言 | |
ALGOL, Lisp, MDL | |
影响语言 | |
Clojure, Common Lisp, Dylan, EuLisp, Haskell, Hop.js, ISLISP, JavaScript, Julia, Lua, R, Racket, Ruby, Rust, S, Scala, Swift LispKit[18] |
简介
编辑在1975年,麻省理工学院的杰拉德·杰伊·萨斯曼与盖伊·史提尔二世,开发出了Scheme语言最初版本,随后两人通过发表“λ论文集”而不断对它进行完善和推广。Scheme与λ演算关系十分密切,故将小写字母“λ”用作标志。
麻省理工学院与其他一些院校,曾采用Scheme教授计算机科学入门课程。著名的入门教材《计算机程序的构造和解释》(SICP),利用Scheme来诠释程序设计[20]。Scheme有众多实现可视为一个主要优势[21],然而不同实现之间的差异成为了它的一个劣势,Scheme掌控委员会声称,它是“世上最不可移植的编程语言”,并且是一个“编程语言家族”而非一个单一的语言[22]。
历史
编辑起源
编辑Scheme起源于1958年由约翰·麦卡锡提出的Lisp语言。麦卡锡通过Lisp证明了,经由几个简单的算子,与用作匿名函数的借鉴自阿隆佐·邱奇的λ表示法[23],就可以构建出图灵完备的系统。麦卡锡提出的S-表达式,可以将程序与数据用相同的结构存储,这被称为同像性。Scheme的语法即来自S-表达式,这一特性使得在Scheme中实现自循环解释器变得非常简单[24]。
在1973年,麻省理工学院的Carl Hewitt提出的一种叫做演员模型的计算模型[25],并用Lisp开发当时叫做Planner-73的新语言来实现它[26]。杰拉德·萨斯曼与盖伊·史提尔为了理解演员模型,决定在Maclisp工作环境中实现一个微型Lisp解释器,并接着增加创建演员和发送消息的机制[27]。两人很快发现了演员模型与λ演算之间的相似性,所谓“演员”就是彼得·兰丁提出的闭包[28],而Joel Moses在1970年已将它介入Lisp用来解决函数参数问题[29];故而实现演员的关键,是将词法作用域介入到Lisp中[30]。
在1975年,基于对λ演算表达能力的理解[31],杰拉德·萨斯曼与盖伊·史提尔开发出了新型Lisp解释器[32],并将在其上完成的微缩版的演员实现命名为“Schemer”,后因ITS操作系统文件名的字符数限制而改为Scheme[33]。尽管Hewitt指出了Scheme不能表达特定类型的原语演员[34],Scheme解释器本身采用的简约的语法和语义,很快赢得广泛接受。
在1978年,盖伊·史提尔二世和杰拉德·杰伊·萨斯曼发表了《修订的Scheme报告:一种LISP方言》[35],从而完善了Scheme的功能,并正式将其确立为Lisp的一种主要方言。在1998年,二人在总结Scheme历史时指出,简单而强大的λ演算,使得Scheme最终得以实现极度的精简化[36]。
λ论文集
编辑“λ论文集”是杰拉德·萨斯曼与盖伊·史提尔撰写的关于Scheme的一系列论文,最早作为麻省理工学院的内部备忘录发表[37]。通常认定λ论文集包括:
- 1975年: Scheme: An Interpreter for Extended Lambda Calculus.[38]
- 1976年: Lambda: The Ultimate Imperative.[39]
- 1976年: Lambda: The Ultimate Declarative.[40]
- 1977年: Debunking the 'Expensive Procedure Call' Myth, or, Procedure Call Implementations Considered Harmful, or, Lambda: The Ultimate GOTO.[41]
- 1978年: The Art of the Interpreter or, the Modularity Complex (Parts Zero, One, and Two).[42]
- 1978年: RABBIT: A Compiler for SCHEME.[43]
- 1979年: Design of LISP-based Processors, or SCHEME: A Dialect of LISP, or Finite Memories Considered Harmful, or LAMBDA: The Ultimate Opcode.[44]
- 1980年: Compiler Optimization Based on Viewing LAMBDA as RENAME + GOTO. AI: An MIT Perspective.[45]
- 1980年: Design of a Lisp-based Processor. CACM. 23. 11.[46]
标准化
编辑Scheme的业界标准,是由专门的掌控委员会发表的《第n次修订的算法语言Scheme报告》(Revisedn Report on the Algorithmic Language Scheme),这种标题形式参照了ALGOL 60标准文档的标题[47]。Scheme曾经由IEEE标准化为IEEE 1178–1990,它于2019年11月07日停用[48]。
1998年通过的R5RS是现在被普遍接受的标准[49]。2007年通过的R6RS[50],做出了很大的变动[51],Scheme社区中有用户指责它在堆积华而不实的功能[52][53]。Scheme掌控委员会决定在R7RS中,将Scheme分化为两个独立而兼容的语言[54]:一个是2013年通过的R7RS-small[55],它为教学场合提供对IEEE/R5RS标准的及时更新,R7RS-small目前已经有了一些实现支持[56];而另一个是作为标准库合集的R7RS-Large[57],它包含了各种提议被接纳并进入冻结状态的Scheme实现要求(SRFI),以此为实际编程场合提供对R6RS标准的继续完善,现已发表了有关数据结构部分的R7RS-Large过渡草案红色版[58]。
命名法
编辑在正式场合比如Scheme标准的命名法中,提及一个λ表达式或原始过程时,偏好使用词语“过程”而非“函数”。在一般使用中,词语“过程”和“函数”是可互换使用的。过程应用有时被正式称呼为“组合”(combination)。
同其他Lisp一样,在Scheme中使用术语“thunk”,来提及没有实际参数的过程。术语“适当”(proper)尾递归,称谓所有Scheme实现的一个性质,它们都进行尾递归优化,从而支持无限数目的活跃尾递归。
基础特征
编辑Scheme大体上是一个函数编程语言,并支持其他编程范型。它同其他Lisp编程语言家族语言共享了很多特征。它的非常简单的语法基于Lisp的S-表达式:即圆括号包围的列表,其中的前缀是算符,而随后那些的是实际参数。故而Scheme程序由嵌套的列表的序列构成。列表也是Scheme中的主要数据结构,这导致了在源代码和数据格式之间的紧密等价性,即同像性。Scheme程序可以轻易的动态建立和求值Scheme代码片段。
Scheme与其他Lisp方言的列表,都是基于最基础的数据结构有序对(pair)。Scheme从它的Lisp先辈继承了一组丰富的列表处理原始运算,比如cons
、car
和cdr
。Scheme的变量都使用动态强类型系统,Scheme中的过程都是头等对象,即头等函数,因此过程可以作为值赋值给变量,或作为实际参数传递给过程。
本章主要集中于语言的创新特征,它们使得Scheme区别于其他Lisp方言。除非专门指出,这里描述的特征都有关于R5RS标准。在本章例子中使用“===> 结果
”表示法,来指示求值在紧前行上的表达式的结果。这同于R5RS中使用的约定。本章节描述的特征使得Scheme不同于在它之前的其他编程语言。Scheme的这些方面强烈的影响了Scheme语言的所有产品,并共通于始自1975年最初的λ论文,特别是自从R2RS以来[59],所有版本的Scheme编程语言。
极简主义
编辑Scheme的简约性,使它成为具备同级别功能的编程语言中最易于实现的语言,这得益于使用λ演算来从更原始的形式导出多数的语言语法[60]。例如在R5RS标准中,定义了23个基于S-表达式的语法构造,其中14个被归类为派生形式或库形式,它们可以被写为涉及原则上为lambdad的更基础形式的宏。正如R5RS(§3.1)所说:“最基础的变量绑定构造是lambda表达式,因为所有其他的变量绑定构造,都可以依据lambda表达式来做出解释”[49]。
- 基础形式:
define
、lambda
、quote
、if
、define-syntax
、let-syntax
、letrec-syntax
、syntax-rules
、set!
- 派生形式:
do
、let
、let*
、letrec
、cond
、case
、and
、or
、begin
、命名let
、delay
、unquote
、unquote-splicing
、quasiquote
下面是一个宏的例子,它将let
实现为使用lambda
来进行变量绑定的一个表达式:
(define-syntax let
(syntax-rules ()
((let ((var expr) ...) body ...)
((lambda (var ...) body ...) expr ...))))
这里的(let ((var expr) ...) body ...)
称为“模式”,模式中起始的let
被称为“关键字”,syntax-rules ()
的空括号中可添入作为辅助关键字的叫做“文字”的标识符,var
、expr
和body
称为“模式变量”,...
是称为“省略号”的标识符,它表示所跟随的子模式可以匹配零或多个输入中的元素;而((lambda (var ...) body ...) expr ...)
称为“模板”。使用上述定义的let
,一个Scheme实现可以将(let ((a 1)(b 2)) (+ b a))
,重写为((lambda (a b) (+ b a)) 1 2)
,这将实现任务缩减为编码过程实例化的工作。
词法作用域
编辑不像更早期的Lisp比如Maclisp[61],Scheme像多数现代语言一样,采用了词法作用域[62]:在一个程序单元中所有可能的变量绑定,都可以通过阅读这个程序单元来分析出来,而不需要考虑到它可能在其中被调用的那些上下文。这对比于动态作用域,它是早期Lisp方言的特征,因为在当时的编译器和解释器中,用来实现词法作用域算法的原始的文字替换方法,关联着处理代价。在动态作用域的Lisp中,对一个过程内的自由变量的引用,依赖于这个调用的上下文,完全有可能引用到这个过程外部的相当不同的绑定。
λ演算
编辑邱奇的λ演算的数学表示法,启发了Lisp使用lambda
作为介入一个过程的关键字,并影响了Lisp中涉及到使用高阶函数的函数式编程技术的发展。但是早期的Lisp由于对自由变量的处理方式[63],而不适合表达λ演算[31]。
一个正式的λ系统,拥有一些公理和一个完备的推理规则。这有助于使用数理逻辑和工具来进行分析。在这个系统中,演算可以被看作直接的演绎。λ演算的语法,来自用x, y, z, ...
、圆括号、空格、点号和符号λ形成的递归表达式[64]。λ演算的功能包括:首先,充当强力的数理逻辑的起点。其次,它可以缩减编程者在考虑实现细节上的要求,因为它可以用于模拟机器求值。最后,λ演算建立了一个坚实的元理论[65]。
在Scheme中,lambda
关键字被用于定义匿名过程,并且使用define
基础形式来定义命名过程,即将一个lambda
过程绑定到指名的全局变量[66]。在Scheme中,(define (foo x) (+ x 1))
与(define foo (lambda (x) (+ x 1)))
,在语法上是等同的。例如有序对可以表示为[67]:
(define (cons x y)
(lambda (m) (m x y)))
(define (car z)
(z (lambda (p q) p)))
(define (cdr z)
(z (lambda (p q) q)))
这样定义出来的cons
、car
和cdr
,满足恒等式(car (cons x y))
等于x
,和(cdr (cons x y))
等于y
。甚至递归也可以表示为利用λ演算的Y组合子[68]。
词法作用域的介入[62],通过在某些形式的λ表示法,和在工作编程语言中它们的实际表达之间做出等价,解决了早期Lisp的有关问题。Sussman和Steele展示了,通过将λ表达式用作“控制结构和环境修改器”,而非简单的过程实例化,可以用这个新语言优雅的导出其他编程语言包括ALGOL和Fortran的,所有指令式和声明式语义,和其他Lisp的动态作用域[69]。他们在第一篇λ论文中,与对Scheme的首次描述一起,介入了续体传递风格[70],并在后续的论文中,他们推进演示了在这种实际使用中体现出的λ演算的原生能力。
过程应用中的求值次序
编辑Scheme采用了传值调用,但不同于多数Lisp规定了过程实际参数的求值次序,Scheme不规定求值次序。对比于其他Lisp,Scheme表达式在算符位置(第一个项目)上可以是一个表达式,只要在算符位置上的表达式的结果是一个过程,这种表示就是完全合法的[71]。
在Scheme中,在算符和实际参数位置上的这些表达式的求值次序,可以在逐个调用的基础上由实现来选择,而唯一的约束是:“运算符和运算数表达式的任何并发求值的效果,被约束为一致于某种顺序次序的求值。”(R5RS sec. 4.1.3)[49]
(let
((ev (lambda (n)
(display "Evaluating ")
(display (if (procedure? n) "procedure" n))
(newline)
n)))
((ev +) (ev 1) (ev 2)))
ev
是一个过程,它描述传递给它的实际参数,接着返回这个实际参数的值。在调用过程+
来加1
和2
之中,表达式(ev +)
、(ev 1)
和(ev 2)
可以按任何次序求值,只要效果上它们不像是并行求值的。因此在上述例子被执行的时候,标准Scheme可以按任何次序来显示前三行:
Evaluating procedure
Evaluating 1
Evaluating 2
3
然而一行的文本不可以同另一行的文本产生交错,因为这会违反顺序求值约束。
块结构
编辑Scheme的代码块结构,不同于早先Lisp的语言构造[72]。自从R2RS开始[59],Scheme中的块,是通过三个“绑定构造”来实现的:let
、let*
和letrec
。例如,下列构造建立一个块,其中叫做var
的符号被绑定到数10
:
(define var "goose")
;; 这里任何到var的引用都会被绑定到"goose"
(let ((var 10))
;; 语句走到这里时,任何到var的引用都会绑定到10
)
;; 这里任何到var的引用都会绑定到"goose"
依据编程者的需要,块可以嵌套来建立任意复杂的块结构。使用块结构来建立局部绑定,减轻了不然可能发生的命名空间冲突。
let
的变体之一let*
,允许绑定引用在前面相同构造中定义的变量,例如:
(let*
((var1 10)
(var2 (+ var1 12)))
;; 但是var1的定义不可以引用var2
)
let
的另一个变体letrec
,被设计用来确使互递归过程可绑定彼此。
;; 将侯世达雌雄序列计算为有序对的列表
(define (hofstadter-male-female n)
(letrec
((female (lambda (n)
(if (= n 0)
1
(- n (male (female (- n 1)))))))
(male (lambda (n)
(if (= n 0)
0
(- n (female (male (- n 1))))))))
(let loop ((i 0))
(if (> i n)
'()
(cons (cons (female i) (male i))
(loop (+ i 1)))))))
(hofstadter-male-female 8)
===> ((1 . 0) (1 . 0) (2 . 1) (2 . 2) (3 . 2) (3 . 3) (4 . 4) (5 . 4) (5 . 5))
在一个单一的letrec
中的所有过程,可以通过名字引用其他的过程,还有在相同的letrec
中此前定义的变量的值,但是它们不可以引用在相同的letrec
中此后定义的变量的值。
let
的变体“命名let
”形式,在let
关键字之后有一个标识符。它将这些let
变量绑定到一个过程的实际参数,这个过程的名字由这个标识符给出,它的过程体是let
形式的主体。这个过程体可以通过调用这个过程达成想要的重复。命名let
被广泛用于实现迭代。
一个简单的计数器例子:
(let loop ((n 1))
(if (> n 10)
'()
(cons n
(loop (+ n 1)))))
===> (1 2 3 4 5 6 7 8 9 10)
就像Scheme中的任何过程一样,在命名let
中建立的这个过程是头等对象。
适当尾递归
编辑Scheme拥有一个迭代构造do
[73],但是在Scheme中更推崇的惯用法,是使用尾递归来表达迭代[74]。遵循标准的Scheme实现被要求优化尾递归,从而支持无限数量的活跃尾递归(R5RS sec. 3.5[49]),这个性质被Scheme报告描述为“适当”(proper)尾递归,它使得Scheme编程者可以安全的使用递归结构来书写迭代算法,这在很多时候是更符合直觉的。尾递归过程和命名let
形式,提供对使用尾递归的迭代的支持。
;; 建造从0到9的平方的列表:
;; 注意:loop简单的就是用作标签的一个任意符号。任何符号都行。
(define (list-of-squares n)
(let loop ((i n) (res '()))
(if (< i 0)
res
(loop (- i 1) (cons (* i i) res)))))
(list-of-squares 9)
===> (0 1 4 9 16 25 36 49 64 81)
头等续体
编辑在Scheme中续体是头等对象[75]。自从R2RS开始[59],Scheme提供了过程call-with-current-continuation
,简写为call/cc
,它捕获当前续体,将其包装成逃脱(escape)过程,再绑定到编程者提供的一个过程的唯一形式参数上。如果逃脱过程在后来某个时候被调用,它会放弃在后来时候正生效的任何续体,转而使用在创建这个逃脱过程之时生效的那个续体。逃脱过程与原本调用call/cc
的续体,接受相同数目的实际参数;除了用call-with-values
创建的续体,所有续体恰好接受一个值(R5RS sec. 6.4)[49]。
头等续体使得编程者能够建立非局部控制结构比如迭代器、协程和回溯。续体可以被用来模拟指令式编程语言中return语句的行为。下列函数find-first
接受函数func
和列表lst
,返回lst
中的第一个元素x
,它使得(func x)
返回真。
(define (find-first func lst)
(call/cc
(lambda (return)
(for-each
(lambda (x)
(if (func x)
(return x)))
lst)
#f)))
(find-first integer? '(1/2 3/4 5.6 7 8/9 10 11))
===> 7
(find-first zero? '(1 2 3 4))
===> #f
David Madore在1999年提出的阴阳谜题[76],展示了Scheme可以将续体当作头等对象处理,绑定它们到变量,和把它们作为给过程的实际参数来传递[77]。
统一的命名空间
编辑对比于Common Lisp,在Scheme中所有的数据和过程共享一个共同的命名空间,而Common Lisp中有函数和数据分离的命名空间,使得一个函数和一个变量可以有相同的名字,并且将一个函数作为值引用时要求特殊的表示法。这有时叫做“Lisp1与Lisp2”差异,二者分别称谓Scheme的统一的命名空间,和Common Lisp的分立的命名空间[78]。
在Scheme中, 可以使用操纵和绑定数据的原始运算来绑定过程。没有等价于Common Lisp的defun
和#'
的原始运算。
;; 变量绑定到一个数:
(define f 10)
f
===> 10
;; 变化(改变绑定值)
(set! f (+ f f 6))
f
===> 26
;; 将一个过程赋值给相同的变量:
(set! f (lambda (n) (+ n 12)))
(f 6)
===> 18
;; 将一个表达式的结果赋值给相同的变量:
(set! f (f 1))
f
===> 13
;; 函数式编程:
(apply + '(1 2 3 4 5 6))
===> 21
(set! f (lambda (n) (+ n 100)))
(map f '(1 2 3))
===> (101 102 103)
实现标准
编辑本章归档多年来做出的给与Scheme特定特征的设计决定,它们不是最初设计的直接产物。
注释
编辑直到R5RS标准,在Scheme中的标准注释是分号,它使得这行余下部分对Scheme不可见。许多实现支持可替代的约定,允许注释扩展为多于一行,而R6RS标准允许其中的两种:一个完整的S-表达式,可以通过前导#;
而变成一个注释(介入于SRFI 62[79]),和通过用#|
和|#
围绕文本,产生的“多行注释”或“块注释”。
在布尔表达式中非布尔值的处理
编辑在多数Lisp方言包括Common Lisp中,布尔表达式中的值NIL
按照惯例被求值为值假。在Scheme中,自从1991年的IEEE标准[48],除了#f
的所有的值,包括在Scheme中写为'()
的NIL
的等价者,在布尔表达式中求值为值真(R5RS sec. 6.3.1)[49]。
在多数Lisp中表示布尔值真的常量是T
,而在Scheme中它是#t
。
干净宏
编辑Scheme的语法可以轻易的通过宏系统来扩展[80]。R5RS标准介入了强力的干净宏系统[81],它允许编程者使用一种简单的模式匹配子语言,向语言增加的新的语法构造(R5RS sec 4.3)[49]。在此前的R4RS标准中,干净宏系统已经作为“高级”系统,和“低级”宏系统一起被编排入标准的附录中[82],二者都被当作对Scheme的扩展而非语言的本质部分。
干净宏的实现,也叫做syntax-rules
, 被要求遵守语言的其余部分的词法作用域。这是通过针对宏展开的特殊命名和作用域规则来确保的,从而避免在其他编程语言的宏系统中可能出现的常见编程错误。R6RS规定了更加复杂的变换系统syntax-case
,它已经作为对R5RS Scheme的一个语言扩展而能够获得到有一段时间了。例如:
;; 定义一个宏来实现“if”的具有多个表达式的变体
;; 有真分支而无假分支
(define-syntax when
(syntax-rules ()
((when pred exp exps ...)
(if pred (begin exp exps ...)))))
宏和过程的调用看起来非常相似,二者都是S-表达式,但是它们被不同的对待。在编译器遇到程序中的一个S-表达式的时候,它首先查看这个符号是否被定义为在当前词法范围内的语法关键字。如果是这样,它接着尝试展开这个宏,将在这个S-表达式尾部的项目当作实际参数,而不用编译代码来求值它们,递归的重复这个处理过程直到没有余留的宏调用。如果它不是一个语法关键字,编译器编译代码来求值在这个S-表达式尾部的实际参数,并接着求值在这个S-表达式头部的符号所表示的变量,把它作为过程来调用,并把最终的尾部表达式作为实际参数传递给它。
多数Scheme实现还提供额外的宏系统。其中最流行是语法闭包、显式重命名宏和define-macro
,它是类似于Common Lisp中提供的defmacro
系统的非干净宏。
不能指定一个宏是否为干净的,是宏系统的一个缺点。可作为替代的展开模型比如作用域集合,提供一种潜在的解决方案[83]。
延迟求值
编辑自从R2RS开始[59],Scheme通过delay
形式和过程force
支持延迟求值。
(define a 10)
(define eval-aplus2 (delay (+ a 2)))
(set! a 20)
(force eval-aplus2)
===> 22
(define eval-aplus50 (delay (+ a 50)))
(let ((a 8))
(force eval-aplus50))
===> 70
(set! a 100)
(force eval-aplus2)
===> 22
delay
原始运算,产生叫做promise的一个对象,它在将来的某个时间点上被询问从而递交结果值,promise的原本定义的文字内容会被留存,并且在第一次使用force
之后它的值也会被留存。promise永远只求值一次。它们可以被用来实现高级的惰性求值构造比如流[84]。
在R5RS中,给出了delay
和force
的推荐实现,将promise实现为没有实际参数的一个过程(thunk),并使用记忆化来确保它永远只求值一次,与调用force
的次数无关(R5RS sec. 6.4)[49]。在R6RS标准中,它们不再是原始运算,转而作为R5RS兼容库的一部分提供(rnrs r5rs (6))。
SRFI 41确使表达有限和无限序列能够具有非凡的经济性。例如,这是使用在SRFI 41中的函数定义的一个斐波那契数列[84]:
;; 定义斐波那契序列
(define fibs
(stream-cons 0
(stream-cons 1
(stream-map +
fibs
(stream-cdr fibs)))))
;; 计算这个序列的第一百个数
(stream-ref fibs 99)
===> 218922995834555169026
eval及其运行环境
编辑R5RS标准之前,在其他Lisp中无处不在的eval
过程,在Scheme中没有等价者。在第一篇λ论文中,曾经将evaluate
描述为“类似于LISP函数EVAL”[85],但在具有词法作用域的Scheme中,对这个表达式在哪个环境中求值存在困惑。例如,不明确求值下列表达式的结果应当是5
还是6
[86]:
(let ((name '+))
(let ((+ *))
(evaluate (list name 2 3))))
在求值实际参数name
的时候,在外层环境中找到了它的定义;在求值结果表达式(+ 2 3)
的时候,如果在外层环境中求值,结果是两个运算数的总和;如果在内层环境中求值,这里符号+
已经被绑定到过程*
的值,结果是两个运算数的乘积。为此在1978年的最初修订报告中,将evaluate
替代为enclose
,它接受分别为代码和运行环境的两个实际参数[87]。由于各种技术和实际原因,第二次、第三次和第四次修订报告省略了任何eval
的等价者[86]。
R5RS通过规定三个返回环境的过程,并提供接受一个S-表达式和一个环境的过程eval
,它在提供的这个环境内求值这个表达式(R5RS sec. 6.5)[49],解决了这个困惑。R6RS通过提供叫做environment
的一个过程进行了扩展,编程者通过它可以精确的指定将哪个对象导入到求值环境之内。
通过现代Scheme(通常兼容于R5RS)来求值表达式expr
,你需要定义如下这样的一个函数evaluate
:
(define (evaluate expr)
(eval expr (interaction-environment)))
interaction-environment
是解释器的全局环境。
数值塔
编辑Scheme规定了数值数据类型的相较完全的集合,包括复数和有理数类型,这在Scheme中叫做“数值塔”(R5RS sec. 6.2[49])。标准对它们作抽象处理,而不委以任何特定的内部表示。
数可以有精确性质量。一个精确数只能从涉及其他精确数的精确运算产生,不精确是传染的。标准规定任何两个实现对产生精确数的所有运算都必须产生等价的结果。
R5RS标准规定了过程exact->inexact
和inexact->exact
,它们可以用于改变一个数的精确性。inexact->exact
产生“在数值上最接近实际参数的精确数”。exact->inexact
产生“在数值上最接近实际参数的不精确数”。R6RS标准在主报告中省略了这些过程,而是在标准库中将它们规定为R5RS兼容过程(rnrs r5rs (6))。
在R5RS标准中,不要求Scheme实现去实现整个数值塔,而是它们必须实现“符合实现用途和Scheme语言精神二者的连贯子集”(R5RS sec. 6.2.3)[49]。新的R6RS标准要求实现整个数值塔,并且“精确整数对象和精确有理数对象实际上有无限的大小和精度,并且对于实现特定过程...所以它们在得到精确实际参数时总是返回精确结果”(R6RS sec. 3.4, sec. 11.7.1)[50]。
在支持精确有理复数的实现中的精确算术例子:
;; 三个有理实数和两个有理复数的总和
(define x (+ 1/3 1/4 -1/5 -1/3i 405/50+2/3i))
x
===> 509/60+1/3i
;; 检查精确性
(exact? x)
===> #t
在既不支持精确有理数也不支持复数却接受有理数表示法的实数的实现中相同算术的例子:
;; 四个有理实数的总和
(define xr (+ 1/3 1/4 -1/5 405/50))
;; 两个有理实数的总和
(define xi (+ -1/3 2/3))
xr
===> 8.48333333333333
xi
===> 0.333333333333333
;; 检查精确性
(exact? xr)
===> #f
(exact? xi)
===> #f
二者实现都符合R5RS标准,但是第二个实现不符合R6RS,因为它没有实现完整的数值塔。
原始数据类型不相交
编辑在Scheme中原始数据类型是不相交的。对于任何Scheme对象,下列谓词中只有一个可以为真:boolean?
、pair?
、symbol?
、number?
、char?
、>string?
、vector?
、port?
和procedure?
(R5RS sec 3.2)[49]。
作为对比,在数值数据类型之内,对数值值是可以有交叠的。例如,一个整数值可以同时满足如下谓词:integer?
、rational?
、real?
、complex?
和number?
(R5RS sec 6.2)[49]。
等价谓词
编辑Scheme在任意对象之间有三种不同类型的等价,它们通过三种不同的“等价谓词”来指示,即测试等式的关系算符eq?
、eqv?
和equal?
:
eq?
求值为#f
,除非它的实际参数表示在内存中相同的对象;eqv?
大体上同于eq?
,但是特殊处理原始对象(比如字符和数),使得表示相同值的数eqv?
为真,即使它们不引用相同的对象;equal?
比较数据结构比如列表、向量和字符串,来确定它们是否有全等的结构并且其内容eqv?
为真(R5RS sec. 6.1)[49]。
在Scheme中还存在依赖于类型的等价运算:string=?
和string-ci=?
比较两个字符串(后者进行不依赖大小写比较);char=?
和char-ci=?
比较字符;=
比较数[49]。
输入/输出
编辑Scheme的输入和输出基于了“端口”数据类型(R5RS sec 6.6)[49]。R5RS定义了两个缺省端口,可通过过程current-input-port
和current-output-port
来访问,它们对应于Unix概念的标准输入和标准输出。多数实现还提供current-error-port
。标准通过标准过程比如with-input-from-file
和with-output-to-file
,支持输入和标准输出的重定向。多数实现提供有重定向能力的字符串端口,使用在SRFI 6中描述的过程[88],确使很多常规的输入-输出运算能在字符串缓冲区上进行而非在文件上。R6RS标准规定了更多复杂和有能力的端口过程和很多新的端口类型。
下面的例子是使用严格的R5RS Scheme书写的。
例子1,缺省输出到current-output-port
:
(let ((hello0 (lambda() (display "Hello world") (newline))))
(hello0))
例子2,类似例子1但对输出过程使用可选的端口参数的例子:
(let ((hello1 (lambda (p) (display "Hello world" p) (newline p))))
(hello1 (current-output-port)))
类似例子1,但是输出被重定向到一个新建文件:
;; NB: with-output-to-file is an optional procedure in R5RS
(let ((hello0 (lambda () (display "Hello world") (newline))))
(with-output-to-file "helloworldoutputfile" hello0))
类似例子2,但是具有显式的文件打开和端口关闭来发送输出到文件:
(let ((hello1 (lambda (p) (display "Hello world" p) (newline p)))
(output-port (open-output-file "helloworldoutputfile")))
(hello1 output-port)
(close-output-port output-port))
类似例子2,但是使用call-with-output-file
来发送输出到一个文件:
(let ((hello1 (lambda (p) (display "Hello world" p) (newline p))))
(call-with-output-file "helloworldoutputfile" hello1))
对输入提供了类似的过程。R5RS Scheme提供了谓词input-port?
和output-port?
。对于字符输入和输出提供了write-char
、read-char
、peek-char
和char-ready?
。为了书写和阅读Scheme表达式,Scheme提供了read
和write
。在读运算时,如果输入端口到达了文件的结束处,则返回的结果是end-of-file
对象,并且这可以使用谓词eof-object?
来测试。
除了标准之外,SRFI 28定义了一个基本的格式化过程,类似Common Lisp的format
并以此命名[89]。
标准过程的重定义
编辑在Scheme中,过程被绑定到变量[66]。在R5RS中语言标准正式的授权编程者可以变更内置过程的变量绑定,在效果上重定义它们(R5RS "Language changes")[49]。例如,通过重定义+
可以将它扩展为同接受数一样接受字符串:
(set! +
(let ((original+ +))
(lambda args
(apply
(if (or (null? args) (string? (car args)))
string-append
original+)
args))))
(+ 1 2 3)
===> 6
(+ "1" "2" "3")
===> "123"
在R6RS中所有绑定,包括标准过程,都属于某个库,并且所有导出的绑定都是不可变的(R6RS sec 7.1)[50]。因此,禁止通过变更进行标准过程的重定义。转而,可以在标准过程的名字下导入不同的过程,这在效果上类似于重定义。
标准形式和过程概述
编辑Scheme语言正式定义于1998年的R5RS和2007年R6RS标准之中。它们描述了标准“形式”,即关键字和随同的语法,它们提供语言的控制结构,和执行通常任务的标准过程。
在标准Scheme中,从一个数据类型转换到另一个数据类型的过程,在它们的名字中包含字符串->
,谓词结束于一个?
,而改变已经分配了数据的值的过程结束于一个!
。Scheme编程者通常跟从这些命名约定。
标准形式
编辑本表格描述Scheme中的标准形式。某些形式出现在不止一行之中,因为它们不能被轻易的归类入语言中的一个单一功能。
在表格中标记了“L”的形式被归类为标准中的派生的“库”形式,并且在实践中通常被实现为使用了更基础形式的宏,这使得实现任务比在其他语言中要容易许多。
用途 | 形式 |
---|---|
定义 | define |
绑定构造 | lambda, do (L), let (L), let* (L), letrec (L) |
条件求值 | if, cond (L), case (L), and (L), or (L) |
顺序求值 | begin (*) |
迭代 | lambda, do (L), 命名let (L) |
语法扩展 | define-syntax, let-syntax, letrec-syntax, syntax-rules (R5RS), syntax-case (R6RS) |
引述 | quote('), unquote(,), quasiquote(`), unquote-splicing(,@) |
赋值 | set! |
延迟求值 | delay (L) |
注意begin
在R5RS中被定义为库语法,但是展开者需要知道它来完成分片功能。在R6RS中它不再是库语法。
标准过程
编辑下面两个表格描述在R5RS Scheme中的标准过程。R6RS更加具有可扩展性而如此总结是不实际的。
某些过程出现在不止一行之中,因为它们不能轻易的分类入语言的一个单一功能。
用途 | 过程 |
---|---|
构造 | vector, make-vector, make-string, list |
等价谓词 | eq?, eqv?, equal?, string=?, string-ci=?, char=?, char-ci=? |
类型转换 | vector->list, list->vector, number->string, string->number, symbol->string, string->symbol, char->integer, integer->char, string->list, list->string |
数值 | 参见独立表格 |
字符串 | string?, make-string, string, string-length, string-ref, string-set!, string=?, string-ci=?, string<? string-ci<?, string<=? string-ci<=?, string>? string-ci>?, string>=? string-ci>=?, substring, string-append, string->list, list->string, string-copy, string-fill! |
字符 | char?, char=?, char-ci=?, char<? char-ci<?, char<=? char-ci<=?, char>? char-ci>?, char>=? char-ci>=?, char-alphabetic?, char-numeric?, char-whitespace?, char-upper-case?, char-lower-case?, char->integer, integer->char, char-upcase, char-downcase |
向量 | make-vector, vector, vector?, vector-length, vector-ref, vector-set!, vector->list, list->vector, vector-fill! |
符号 | symbol->string, string->symbol, symbol? |
有序对和列表 | pair?, cons, car, cdr, set-car!, set-cdr!, null?, list?, list, length, append, reverse, list-tail, list-ref, memq. memv. member, assq, assv, assoc, list->vector, vector->list, list->string, string->list |
识别谓词 | boolean?, pair?, symbol?, number?, char?, string?, vector?, port?, procedure? |
续体 | call-with-current-continuation (call/cc), values, call-with-values, dynamic-wind |
环境 | eval, scheme-report-environment, null-environment, interaction-environment (可选) |
输入/输出 | display, newline, read, write, read-char, write-char, peek-char, char-ready?, eof-object? open-input-file, open-output-file, close-input-port, close-output-port, input-port?, output-port?, current-input-port, current-output-port, call-with-input-file, call-with-output-file, with-input-from-file(可选), with-output-to-file(可选) |
系统接口 | load (可选), transcript-on (可选), transcript-off (可选) |
延迟求值 | force |
函数式编程 | procedure?, apply, map, for-each |
布尔逻辑 | boolean? not |
在其名字中包含-ci
的字符串和字符过程,在它们的实际参数之间进行不依赖大小写的比较:相同字符的大写和小写版本被认为是相等的。
用途 | 过程 |
---|---|
基本算术算符 | +, -, *, /, abs, quotient, remainder, modulo, gcd, lcm, expt, sqrt |
有理数 | numerator, denominator, rational?, rationalize |
近似 | floor, ceiling, truncate, round |
精确性 | inexact->exact, exact->inexact, exact?, inexact? |
不等式 | <, <= , >, >=, = |
杂项谓词 | zero?, negative?, positive? odd? even? |
极大和极小 | max, min |
三角 | sin, cos, tan, asin, acos, atan |
指数 | exp, log |
复数 | make-rectangular, make-polar, real-part, imag-part, magnitude, angle, complex? |
输入-输出 | number->string, string->number |
类型谓词 | integer?, rational?, real?, complex?, number? |
接受多于二个实际参数的-
和/
在R5RS中被定义了但留作为可选的。
Scheme实现要求
编辑由于Scheme的极简主义,很多常见过程和语法形式不是由标准定义的。 为了保持核心语言很小并促进扩展的标准化,Scheme社区拥有“Scheme实现要求”(SRFI)流程,通过对扩展提案的仔细研讨来定义扩展库。这提升了代码可移植性。很多SRFI被几乎所有Scheme实现支持。
在不同的实现中具有相当广泛支持的SRFI包括[90]:
- 0: 基于特征的条件展开构造
- 1: 列表库
- 4: 同质数值向量数据类型
- 6: 基本字符串端口
- 8: 接收、绑定到多个值
- 9: 定义记录类型
- 13: 字符串库
- 14: 字符集库
- 16: 可变元数过程的语法
- 17: 广义
set!
- 18: 多线程支持
- 19: 时间数据类型和过程
- 25: 多维数组原语
- 26: 不用柯里化的特殊化形式参数的表示法
- 27: 随机数位的来源
- 28: 基本格式化字符串
- 29: 本地化
- 30: 嵌套的多行注释
- 31: 递归求值的特殊形式
- 37:
args-fold
:程序实际参数处理器 - 39: 形式参数对象
- 41: 流
- 42: 及早推导式
- 43: 向量库
- 45: 表达迭代式惰性算法的原语
- 60: 作为比特的整数
- 61: 更一般性的
cond
子句 - 66: 八位组向量
- 67: 比较过程
实现
编辑Scheme的精简设计,使得编程语言设计人士与爱好者,特别钟爱研究它,故而它有不断涌现的新实现,而活跃开发的实现也在持续跟进语言标准更新[91]。尽管Scheme有众多实现是它的一个主要长处[21],但是由于Scheme没有库函数标准,造成了很多不可互通的实现互相竞争,试图使用Scheme编写既复杂又便于移植的程序,往往比较困难[22]。R6RS和R7RS-Large,在核心语言之外制定了库函数标准,使得编译器开发者和贡献者可以实现Scheme的可移植库。
几乎所有Scheme实现都有传承自Lisp的“读取﹣求值﹣输出循环”交互模式,一些Scheme实现亦可作为编译器。很多用C语言及派生语言写成的软件,都利用Scheme作为脚本语言,很多嵌入式系统语言即是基于Scheme。Chez Scheme和Larceny等实现,可以将Scheme程序编译为本机二进制码。Gambit和Chicken等实现,可将Scheme程序编译为C代码,Bigloo还能编译成JVM字节码或者.Net字节码。
一些实现支持额外的特征。比如Kawa提供与Java类的集成,而Scheme到C编译器通常易于使用C写成的外部库,甚至允许将实际C代码嵌入到Scheme源代码中。另一个例子是用java书写的Pvts,它提供了支持Scheme学习的一组可视工具。
教科书
编辑实际用处
编辑很多著名的计算机科学院校都利用Scheme来教授入门级课程。以下为一些最为著名的教授Scheme的学校:
- 麻省理工学院是Scheme与SICP的诞生地。直到2008年为止,麻省理工学院的入门课程6.001即是用Scheme来教授的[93]。尽管现在Scheme已经不再被用于入门课程,麻省理工学院到目前为止还在教授SICP[94]。
- 伯克利加州大学的入门课程61A到2010年为止利用Scheme与SICP教授入门课程,并利用Scheme来实现Logo,另一个基于Lisp的编程语言[95]。自2011年起,61A改用Python来教授SICP[96]。
- 西北大学的入门课程CS2500利用Scheme来教授另一本著名的教材《程序设计方法》。
- 印第安那大学的入门课程C211利用Scheme来教授。
- 耶鲁大学
- 莱斯大学
- 香港科技大学
- 北京大学
- ProgramByDesign项目在美国超过600所高中教授Scheme语言。
- 滑铁卢大学数学系(包括计算机科学系)的入门课程CS115、CS116、CS135利用Scheme来教授。
- 云林科技大学
自由软件影像处理程序GIMP利用Scheme为嵌入式脚本语言[97]。GNOME中有到核心库的一个GNU的扩展语言Guile包装器[98]。在2012年出现的Julia所采用的语法解析器,是用Scheme方言Femtolisp实现的。
参见
编辑注释
编辑- ^ https://small.r7rs.org/.
- ^ BiwaScheme is a Scheme interpreter written in JavaScript.
- ^ Alex Shinn. Chibi-Scheme is a very small library intended for use as an extension and scripting language in C programs. [2022-11-13]. (原始内容存档于2022-12-23).
- ^ Cyclone Scheme is a brand-new compiler that allows real-world application development using the R7RS Scheme Language standard. [2022-11-13]. (原始内容存档于2022-09-27).
- ^ Foment is an implementation of R7RS Scheme. [2022-11-13]. (原始内容存档于2022-11-13).
- ^ Gerbil is an opinionated, some might even say tendentious, dialect of Scheme designed for systems programming.
- ^ Larceny is a simple and efficient implementation of the Scheme programming language.
- ^ LIPS is poweful Scheme based lisp interpreter written in JavaScript. [2021-11-09]. (原始内容存档于2022-04-23).
- ^ Loko Scheme, an optimizing Scheme compiler. [2022-11-13]. (原始内容存档于2022-12-06).
- ^ Mosh is a free and fast interpreter for Scheme as specified in the R7RS & R6RS. [2022-11-13]. (原始内容存档于2022-12-06).
- ^ Picrin is a lightweight R7RS scheme implementation written in pure C89. [2022-11-13]. (原始内容存档于2022-12-06).
- ^ Rapid Scheme expands and evaluates a Scheme program as described by the R7RS. [2022-11-13]. (原始内容存档于2022-11-28).
- ^ Sagittarius Scheme - R6RS/R7RS Scheme system. [2022-11-13]. (原始内容存档于2022-12-24).
- ^ TR7: tiny R7RS-small scheme interpreter. [2023-12-17]. (原始内容存档于2023-12-17).
- ^ Ypsilon: R7RS/R6RS Scheme Implementation.
- ^ GNU Mes is a Scheme interpreter and C compiler for bootstrapping the GNU System.
- ^ femtolisp - a lightweight, robust, scheme-like lisp implementation. [2022-11-14]. (原始内容存档于2022-12-22).
- ^ Swift LispKit is a framework for building Lisp-based extension and scripting languages for macOS and iOS applications. [2022-11-13]. (原始内容存档于2022-12-09).
- ^ The Scheme Programming Language. MIT. [2022-05-04]. (原始内容存档于2022-04-12).
- ^ Harold Abelson and Gerald Jay Sussman with Julie Sussman. Structure and Interpretation of Computer Programs. Cambridge, MA: MIT Press. 1996 [2011]. ISBN 0-262-01153-0. (原始内容存档于2018-03-09) (英语).
- ^ 21.0 21.1 scheme-faq-standards - What Scheme implementations are there?. Community Scheme Wiki. [2021-11-09]. (原始内容存档于2010-02-18).
- ^ 22.0 22.1 Will Clinger, Marc Feeley, Chris Hanson, Jonathan Rees and Olin Shivers. Scheme Steering Committee Position Statement. Scheme Steering Committee. 2009-08-20 [2011-12-19]. (原始内容存档于2009-08-26).
Scheme has the unhappy distinction of being the world's most unportable programming language. It is almost misleading to call Scheme a "programming language;" it would be more accurate to characterise Scheme as a family of dialects, all loosely related by the common features of lexical scope, dynamic typing, list structure, higher-order functions, proper tail-recursion, garbage collection, macros, and (some form of) s-expression based lexical syntax.
- ^ John McCarthy. History of Lisp (PDF). Artificial Intelligence Laboratory, Stanford University. 12 February 1979 [2022-11-17]. (原始内容存档 (PDF)于2020-11-07).
To use functions as arguments, one needs a notation for functions, and it seemed natural to use the λ-notation of Church (1941). I didn’t understand the rest of his book, so I wasn’t tempted to try to implement his more general mechanism for defining functions. Church used higher order functionals instead of using conditional expressions. Conditional expressions are much more readily implemented on computers.
David Turner. Some History of Functional Programming Languages (PDF). [2021-11-10]. (原始内容 (PDF)存档于2020-04-15).LISP was not based on the lambda calculus, despite using the word “LAMBDA” to denote functions. At the time he invented LISP, McCarthy was aware of (Church 1941) but had not studied it. The theoretical model behind LISP was Kleene’s theory of first order recursive functions. (McCarthy made these statements, or very similar ones, in a contribution from the floor at the 1982 ACM symposium on LISP and functional programming in Pittsburgh. No written version of this exists, as far as know.)
- ^ A meta-circular interpreter of a subset of Scheme. [2022-11-14]. (原始内容存档于2022-11-14).
- ^ Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
At about this time, Carl Hewitt was developing his theory of actors as a model of computation. This model was object-oriented and strongly influenced by Smalltalk. Every object was a computationally active entity capable of receiving and reacting to messages. The objects were called actors, and the messages themselves were also actors. Every computational entity was an actor and message-passing was the only means of interaction. An actor could have arbitrarily many acquaintances; that is, it could “know about” other actors and send them messages or send acquaintances as (parts of) messages. Hewitt then went on to model many traditional control structures as patterns of message-passing among actors. Functional interactions were modeled with the use of continuations; one might send the actor named “factorial” a message containing two other actors: the number 5 and another actor to which to send the eventually computed value (presumably 120). While Conniver made control points into first-class data, the actors model went to the logical extreme by making everything be first-class data.
- ^
Carl Hewitt, Peter Bishop, Irene Greif, Brian Smith, Todd Matson, Richard Steiger. Actor induction and meta-evaluation. 1973 [2022-11-15]. (原始内容存档于2022-11-15). “For some years we have been constructing a series of languages to express our evolving understanding of the above semantic issues. The latest of these
is called PLANNER-73. ……
We shall assume that the reader is familiar with advanced pattern matching languages such as SNOBOL4, CONVERT, PLANNER-71, QA4, and POPLER.
We shall use(%A M%)
to indicate sending the message M to the actor A. We shall use[s1 s2 … sn]
to denote the finite squence s1, s2, … sn. ……
Identifiers can be created by the prefix operator=
. ……
An expression(=> pattern body)
is an abbreviation for(receive(message pattern) body)
where receive is a more general actor that is capable of bindinq elements of the action in addition to the message.
Evaluating
(%(=> pattern body) the-message%)
, i.e. sending
(=> pattern body) the-message
,
will attempt to match the-message against pattern, If the-message is not of the form specified by pattern, then the actor isNOT-APPLICABLE
to the-message. If the-message matches pattern, the body is evaluated.
……
Conceptually at least a new actor is created every time a message is sent. Consider sending to a target T a message M and a continuation C.
(send T (message M [#continuation C]))
The transmission
(%T M%)
is an abbreviation for the above where C is defaulted to be the caller. If target T is the following:(receive (message the-body [#continuation =the-continuation]) the-body)
then the the-body is evaluated in an environment where the-message is bound to M and the-continuation is bound to C.
……
Many actors who are executing in parallel can share the same continuation. They can all send a message[“return”]
to the same continuation. ”
Irene Greif, Carl Hewitt. Actor semantics of PLANNER-73. 1975 [2022-11-15]. (原始内容存档于2022-11-15). - ^
Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
Hewitt and his students developed and implemented in Lisp a new language to make concrete the actor model of computation. This language was first called Planner-73 but the name was later changed to PLASMA (PLAnner-like System Modeled on Actors).
It was at this point that we (Sussman and Steele) put our heads together. (Steele had just become a graduate student at MIT, but he had been following this progression of language designs because he had had a part-time job at MIT Project MAC as one of the maintainers of MacLisp.) We wanted to better understand Hewitt’s actors model but were having trouble relating the actors model and its unusual terminology to familiar programming notions. We decided to construct a toy implementation of an actor language so that we could play with it. Using MacLisp as a working environment, we wrote a tiny Lisp interpreter and then added mechanisms for creating actors and sending messages. We wanted to better understand Hewitt’s actors model but were having trouble relating the actors model and its unusual terminology to familiar programming notions. We decided to construct a toy implementation of an actor language so that we could play with it. Using MacLisp as a working environment, we wrote a tiny Lisp interpreter and then added mechanisms for creating actors and sending messages. - ^ 彼得·兰丁. The mechanical evaluation of expressions (PDF). 1964 [2022-11-17]. (原始内容存档 (PDF)于2022-11-16).
Also we represent the value of a λ-expression by a bundle of information called a "closure," comprising the λ-expression and the environment relative to which it was evaluated. We must therefore arrange that such a bundle is correctly interpreted whenever it has to be applied to some argument. More precisely:
a closure has
an environment part which is a list whose two items are:
⑴ an environment
⑵ an identifier or list of identifiers,
and a control part which consists of a list whose sole item is an AE.
The value relative toE
of a λ-expressionX
is represented by the closure denoted by
constructclosure((E, bvX), unitlist(bodyX))
- ^
Joel Moses. The Function of FUNCTION in LISP, or Why the FUNARG Problem Should Be Called the Environment Problem (pdf). June 1970 [2009-10-27]. AI Memo 199. (原始内容存档于2010-05-23).
The use of free variables is a very powerful computational idea. In many situations it is possible to avoid using free variables by supplying a function with additional arguments. This can be quite cumbersome. ……
When a function uses free variables or when it calls functions which use them, then the value of this function is not completely determined by its arguments, since the value might depend on the values of the free variables. Thus in order to completely specify a computation, one has to specify a function and its arguments in addition to an environment in which the values of the free variables can be determined. ……
An important point which must be realized about functional arguments (abbreviated FUNARGs) is that two different environments are involved in such cases. The first environment is the one which is in effect when then FUNARG is bound as an argument. We shall call this the binding environment. The second environment is the one that is in effect when the FUNARG is activated as a function. We shall call this the activation environment. ……
Consider now what it would require for the system to restore the binding environment for functional arguments. It would require knowing where in the stack or "alist" the binding environment exists through some pointer to it. Supplying such pointer is a functionFUNCTION
in LISP. That is when one transmits a functional argumentf
which is to be evaluated in its binding environment, then one usesFUNCTION(f)
.FUNCTION
will prevent its argumentf
from being evaluated, just asQUOTE
would. The result ofFUNCTION
will be a structure which not only contains a reference to the functionf
, but also contains a pointer to the binding environment. Thus at the FUNARG's activation time we will be able to use the current place in the stack up to the point where the binding environment exists, thus skipping over any values in the activation environment which differ from those in the binding environment. ……
The points we have made so far are:
1) Free variables in function definitions require that one must have an environment in order to be able to evaluate a function.
2) When one uses functional arguments, the the question arises as to which environment one chooses in order to evaluate the function, the binding environment or the activation environment.
3) When one returns functions (which require the values of free variables for their evaluation) then one must, in general, save the binding environment and thus create a stack which is, in fact, a tree. ……
LISP allows the user to choose between the binding environment and the activation environment by allowing a function to be bound withFUNCTION
or withQUOTE
.QUOTE
will force the activation environment to be used in evaluating the function. A useful metaphor for the difference betweenFUNCTION
andQUOTE
in LISP is to think ofQUOTE
as a porous or an open covering of the function since free variables escape to the current environment.FUNCTION
acts as a closed or nonporous covering (hence the term "closure" used by Landin). Thus we talk of "open" Lambda expressions (functions in LISP are usually Lambda expressions) and "closed" Lambda expressions. ……
My interest in the environment problem began while Landin, who had a deep understanding of the problem, visited MIT during 1966-67. I then realized the correspondence between the FUNARG lists which are the results of the evaluation of "closed" Lambda expressions in LISP and ISWIM's Lambda Closures. - ^
Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
Sussman had just been studying Algol. He suggested starting with a lexically scoped dialect of Lisp, because that seemed necessary to model the way names could refer to acquaintances in PLASMA. Lexical scoping would allow actors and functions to be created by almost identical mechanisms. Evaluating a form beginning with the word
lambda
would capture the current variable-lookup environment and create a closure; evaluating a form beginning with the wordalpha
would also capture the current environment but create an actor. Message passing could be expressed syntactically in the same way as function invocation. The difference between an actor and a function would be detected in the part of the interpreter traditionally known asapply
. A function would return a value, but an actor would never return; instead, it would typically invoke a continuation, another actor that it knew about. Our interpreter also provided the necessary primitives for implementing the internal behavior of primitive actors, such as an addition operator that could accept two numbers and a continuation actor. - ^ 31.0 31.1 Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
This led us to three important ideas:
• First, we realized that all the patterns of control structure that Hewitt had described in terms of actors could equally well be described by the λ-calculus. ……
• Second, we realized that the λ-calculus — a small, simple formalism — could serve as the core of a powerful and expressive programming language. (Lisp had adopted the λ-notation for functions but had failed to support the appropriate behavior for free variables. The original theoretical core of Lisp was recursion equations, not the λ-calculus.) ……
• Third, we realized that in our quest for the “ultimate AI language” we had come full circle. As the MIT school had struggled to find more and more powerful ways to express and manipulate control structure to support heuristic search, we had progressed from Lisp to CONVERT to Planner to Conniver to PLASMA to a simple variation of Lisp! - ^ Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
Inspired by ACTORS [Greif and Hewitt][Smith and Hewitt], we have implemented an interpreter for a LISP-like language, SCHEME, based on the lambda calculus [Church], but extended for side effects, multiprocessing, and process synchronization. The purpose of this implementation is tutorial. We wish to:
⑴ alleviate the confusion caused by Micro-PLANNER, CONNIVER, etc. by clarifying the embedding of non-recursive control structures in a recursive host language like LISP.
⑵ explain how to use these control structures, independent of such issues as pattern matching and data base manipulation.
⑶ have a simple concrete experimental domain for certain issues of programming semantics and style.
LISP code for the SCHEME interpreter by Gerald Jay Sussman and Guy Lewis Steele, Jr. [2022-11-14]. (原始内容存档于2022-11-14). - ^ Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
We were very pleased with this toy actor implementation and named it “Schemer” because we thought it might become another AI language in the tradition of Planner and Conniver. However, the ITS operating system had a 6-character limitation on file names and so the name was truncated to simply SCHEME and that name stuck.
- ^ Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
We concluded that actors and closures were effectively the same concept. (Hewitt later agreed with this, but noted that two types of primitive actors in his theory, namely cells (which have modifiable state) and synchronizers (which enforce exclusive access), cannot be expressed as closures in a lexically scoped pure Lisp without adding equivalent primitive extensions.)
Carl Hewitt. ActorScript: Industrial strength integration of local and nonlocal concurrency for Client-cloud Computing. 2009 [2022-12-23]. (原始内容存档于2022-12-23).In summary, Sussman and Steele [1975] mistakenly concluded “we discovered that the 'Actors' and the lambda expressions were identical in implementation.” The actual situation is that the λ-calculus is capable of expressing some kinds of sequential and parallel control structures but, in general, not the concurrency expressed in the Actor model. On the other hand, the Actor model is capable of expressing everything in the λ-calculus and more [Hewitt 2008f].
- ^ Guy L. Steele Jr., Gerald J. Sussman. The Revised Report on SCHEME: A Dialect of LISP (PDF). 1978 [2022-11-18]. (原始内容存档 (PDF)于2022-12-12).
SCHEME is a dialect of LISP. It is an expression-oriented, applicative order, interpreter-based language which allows one to manipulate programs as data. It differs from most current dialects of LISP in that it closes all lambda-expressions in the environment of their definition or declaration, rather than in the execution environment. This has the consequence that variables are normally lexically scoped, as in ALGOL. However, in contrast with ALGOL, SCHEME treats procedures as a first-class data type. They can be the values of variables, the returned values of procedures, and components of data structures. Another difference from LISP is that SCHEME is implemented in such a way that tail-recursions execute without net growth of the interpreter stack. The effect of this is that a procedure call behaves like a
GOTO
and thus procedure calls can be used to implement iterations, as in PLASMA. - ^
Gerald J. Sussman, Guy L. Steele Jr. The First Report on Scheme Revisited. 1998 [2021-11-07]. (原始内容存档于2022-04-06).
We were actually trying to build something complicated and discovered, serendipitously, that we had accidentally designed something that met all our goals but was much simpler than we had intended.
We thought that Scheme would turn out to be the next in a long series of programming languages that had been designed at MIT over the course of 17 years. ……
In this process we learned some great lessons. The λ-calculus can be seen as a universal glue by which compound constructs can be built inductively from simpler ones and a set of primitives. …… The essence of the matter is that the λ-calculus provides a way of abstracting entities of any particular type and then combining such abstractions with other entities to make new entities of that same type, which are instances of the abstractions. …… Given the right primitives, the λ-calculus can serve as a foundation for the abstraction and instantiation of virtually any kind of complex conceptual structure.
We also were struck by the great importance of names as a means of reference. …… Naming seems to correspond to some important cognitive mechanism that is, if not innate, then at least extremely well-developed in our culture. The λ-calculus embodies a provably coherent theory of naming. - ^ The Original 'Lambda Papers' by Guy Steele and Gerald Sussman. [2021-11-07]. (原始内容存档于2022-01-12).
- ^ Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
- ^ Gerald J. Sussman, Guy L. Steele Jr. Lambda: The Ultimate Imperative. 1976 [2021-11-11]. (原始内容存档于2022-04-10).
- ^ Guy L. Steele Jr. LAMBDA: The Ultimate Declarative. 1976 [2021-11-10]. (原始内容存档于2022-04-09).
- ^ Guy L. Steele Jr. Debunking the 'Expensive Procedure Call' Myth, or, Procedure Call Implementations Considered Harmful, or, Lambda: The Ultimate GOTO. 1977 [2021-11-07]. (原始内容存档于2022-05-09).
- ^ Gerald J. Sussman, Guy L. Steele Jr. The Art of the Interpreter of the Modularity Complex (Parts Zero, One, and Two). 1978 [2021-11-07]. (原始内容存档于2021-11-07).
- ^ Guy L. Steele Jr. RABBIT: A Compiler for SCHEME. 1978 [2021-11-07]. (原始内容存档于2021-11-08).
Rabbit Scheme: Old Scheme implementation in InterLisp. [2022-11-15]. (原始内容存档于2021-12-03). - ^ Gerald J. Sussman, Guy L. Steele Jr. Design of LISP-based Processors, or SCHEME: A Dielectric LISP, or Finite Memories Considered Harmful, or LAMBDA: The Ultimate Opcode. 1979 [2021-11-07]. (原始内容存档于2021-11-07).
- ^ Guy L. Steele Jr. Compiler Optimization Based on Viewing LAMBDA as RENAME + GOTO. Patrick Henry Winston, Richard H. Brown (编). Artificial Intelligence: An MIT Perspective, Volume 2: Understanding Vision/Manipulation/Computer Design/Symbol Manipulation. 1979: 399–432 [2022-12-07]. (原始内容存档于2022-12-07).
- ^ Gerald J. Sussman, Guy L. Steele Jr. Design of a LISP-based microprocessor. 1980 [2021-11-07]. (原始内容存档于2021-11-08).
- ^ J.W. Backus; F.L. Bauer; J.Green; C. Katz; J. McCarthy P. Naur; et al. Revised Report on the Algorithmic Language Algol 60. Numerische Mathematik, Communications of the ACM, and Journal of the British Computer Society. January–April 1960 [2012-08-09]. (原始内容存档于2007-06-25).
- ^ 48.0 48.1 IEEE 1178-1990 - IEEE Standard for the Scheme Programming Language. [2022-01-14]. (原始内容存档于2021-03-04).
- ^ 49.00 49.01 49.02 49.03 49.04 49.05 49.06 49.07 49.08 49.09 49.10 49.11 49.12 49.13 49.14 49.15 49.16 Richard Kelsey, William Clinger, Jonathan Rees; et al. Revised5 Report on the Algorithmic Language Scheme. Higher-Order and Symbolic Computation. August 1998, 11 (1): 7–105 [2007-01-04]. doi:10.1023/A:1010051815785. (原始内容存档于2007-01-05) (英语).
Scheme is a statically scoped and properly tail-recursive dialect of the Lisp programming language invented by Guy Lewis Steele Jr. and Gerald Jay Sussman. It was designed to have an exceptionally clear and simple semantics and few different ways to form expressions. A wide variety of programming paradigms, including imperative, functional, and message passing styles, find convenient expression in Scheme.
- ^ 50.0 50.1 50.2 Michael Sperber, R. Kent Dybvig, Matthew Flatt, Anton Van Straaten; et al. Revised6 Report on the Algorithmic Language Scheme. Scheme Steering Committee. August 2007 [2009-10-20]. (原始内容存档于2013-06-25).
This report is accompanied by a report describing standard libraries; references to this document are identified by designations such as “library section” or “library chapter”. It is also accompanied by a report containing non-normative appendices. A fourth report gives some historical background and rationales for many aspects of the language and its libraries.
- ^ Revised6 Report on the Algorithmic Language Scheme, Appendix E: language changes. Scheme Steering Committee. 2007-09-26 [2011-12-19]. (原始内容存档于2010-04-09).
- ^ R6RS Electorate. Scheme Steering Committee. 2007 [2009-10-20]. (原始内容存档于2008-12-04).
- ^ Marc Feeley (compilation). Implementors' intentions concerning R6RS. Scheme Steering Committee, r6rs-discuss mailing list. 2007-10-26 [2009-10-20]. (原始内容存档于2008-08-20).
- ^ R7RS Home Page. [2022-11-13]. (原始内容存档于2022-12-17).
- ^ Alex Shinn, John Cowan, Arthur A. Gleckler; et al. The Revised7 Report on the Algorithmic Language Scheme (PDF). 2022-09-06 [2022-11-13]. (原始内容存档 (PDF)于2022-11-13).
R7RS-small archive. [2021-11-10]. (原始内容存档于2023-01-01). - ^ Implementations support all or part of R7RS-small. [2022-11-13]. (原始内容存档于2022-11-13).
- ^ R7RS Home Page.
- ^ R7RS-Large Interim Red Edition. 2020-03-17 [2022-11-13]. (原始内容存档于2022-11-13).
The Revised7 Report on the Algorithmic Language Scheme (“R7RS”) intentionally describes a small language, suitable for implementing on reduced hardware, or for experiments in programming language design and implementation.
However, Scheme is used for practical programming, and there a minimal language is not a help. Additional data structures and programming idioms make the programmer’s job easier (if they are well-designed!). Therefore, as part of the R7RS design process, it was agreed that a subsequent report would appear on “R7RS-Large”, a collection of Scheme libraries that are useful across a wide variety of problems.
The decision of what was to appear in R7RS-Large was left to the community, and a strategy was adopted: proposals were published as SRFIs (Scheme Requests For Implementation, http://srfi.schemers.org), and the community would then vote on which would be adopted.
As a result, R7RS-Large will be developed in several stages, depending upon the support from the community for each proposal. At each stage, several SRFIs are frozen, and implementers and users may then rely upon those libraries being in the final product. (In a few cases, a library might be revisited, with a subsequent stage adopting a completely backward-compatible new version of that library; this has already happened with generators.)
When this process has completed, the resulting interim reports will be collated and reorganized into a final document. - ^ 59.0 59.1 59.2 59.3 William Clinger. The Revised Revised Report on Scheme or An Uncommon Lisp (PDF). 1985 [2022-11-17]. (原始内容存档 (PDF)于2022-10-17).
Data and procedures and the values they amass, Higher-order functions to combine and mix and match, Objects with their local state, the message they pass, A property, a package, the control of point for a catch - In the Lambda Order they are all first-class. One thing to name them all, one things to define them, one thing to place them in environments and bind them, in the Lambda Order they are all first-class. ……
Scheme shares with Common Lisp the goal of a core language common to several implementations. Scheme differs from Common Lisp in its emphasis upon simplicity and function over compatibility with older dialects of Lisp. - ^ Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978.
System-Provided Extensions - Some standard syntactic extension are provided by convenience in ordinary programming. They are distinguished from other magic words in that they are semantically defined in terms of others rather then being primitive {Note FEXPRs Are Okay by Us}. For expository purposes they are described here pattern-matching/production-rules kind of language. takes advantage of the definition of list notation:
(A B C) = (A . (B . (C . NIL)))
. Thus the pattern(x . r)
matches(A B C)
, withx = A
andr = (B C)
. the ordering of the "productions" is significant; the first one which matches is to be used. - ^ Kent M. Pitman. The Revised Maclisp Manual. 1983, 2007 [2022-12-16]. (原始内容存档于2022-12-16).
In the interpreter “variables” are implemented as atomic symbols which possess “shallow-bound” value cells. The continual manipulation of value cells would decrease the efficiency of compiled code, so the compiler defines two types of variables: “special variables” and “local variables.” Special variables are identical to variables in the interpreter.
Local variables are more like the variables in commonly-used algebraic programming languages such as Algol or PL/I. A local variable no longer has an associated atomic symbol; thus it can only be referred to from the text of the same function that bound it. The compiler creates local variables forPROG
variables,DO
variables, andLAMBDA
variables, unless directed otherwise. The compiled code stores local variables in machine registers or in locations within a stack.
The principal difference between local variables and special variables is in the way a binding of a variable is compiled. (A binding has to be compiled when aPROG
,DO
, orLAMBDA
expression is compiled, and for the entry to a function which hasLAMBDA
variables to be bound to its arguments.) If the variable to be bound has been declared to be special, the binding is compiled as code to imitate the way the interpreter binds variables: the value of the atomic symbol is saved and a new value is stored into its value cell. If the variable to be bound has not been declared special, the binding is compiled as the declaration of a new local variable. Code is generated to store the value to which the variable is to be bound into the register or stack-location assigned to the new local variable. This runs considerably faster than a special binding.
Although a special variable is associated with an atomic symbol which has the name of the variable, the name of a local variable appears only in the input file - in compiled code there is no connection between local variables and atomic symbols. Because this is so, a local variable in one function may not be used as a “free variable” in another function since there is no way for the location of the variable to be communicated between the two functions.
When the usage of a variable in a program to be compiled does not conform to these rules, i.e. it is somewhere used as a “free variable,” the variable must be declared special. There are two common cases in which this occurs. One is where a “global” variable is being used, i.e. a variable which isSETQ
'ed by many functions but is never bound. The other is where two functions cooperate, one binding a variable and then calling the other one which uses that variable as a free variable. - ^ 62.0 62.1
Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
There are several important consequences of closing every lambda expression in the environment from which it is passed (i.e., in its "lexical" or "static" environment).
First, the axioms of lambda calculus are automatically preserved. Thus, referential transparency is enforced. This in turn implies that there are no "fluid" variable bindings (as there are in standard stack implementations of LISP such as MacLISP).
Second, the upward funarg problem [Moses] requires that the environment structure be potentially tree-like.
Finally, the environment at any point in a computation can never be deeper than the lexical depth of the expression being evaluated at that time; i.e., the environment contains bindings only for variables bound in lambdas lexically surrounding the expression being evaluated. This is true even if recursive functions are involved. ……Furthermore, it is not even necessary to scan the environment for the variable, since its value must be in a known position relative to the top of the environment structure; this position can be computed by a compiler at compile time on the basis of lexical scope. - ^
John McCarthy, Paul W. Abrahams, Daniel J. Edwards, Timothy P. Hart, Michael I. Levin. LISP 1.5 Programmer's Manual (PDF) 2nd. MIT Press. 1985 [1962] [2021-10-25]. ISBN 0-262-13011-4. (原始内容 (PDF)存档于2021-03-02).
The variables in a lambda expression are dummy or bound variables because systematically changing them does not alter the meaning of the expression. Thus
λ[[u;v];v2+u]
means the same thing asλ[[x;y];y2+x]
.
We shall sometimes use expressions in which a variable is not bound by a lambda. For example, in the function of two variablesλ[[x;y];xn+yn]
the variablen
is not bound. This is called a free variable. It may be regarded as a parameter. Unlessn
has been given a value before trying to compute with this function, the value of the function must be undefined. ……
When the compiler is called upon to compile a function, it looks for anEXPR
orFEXPR
on the property list of the function name. The compiler then translates this S-expression into an S-expression that represents a subroutine in the LISP Assembly Language (LAP). LAP then proceeds to assemble this program into binary program space. Thus anEXPR
, or anFEXPR
, has been changed to aSUBR
or anFSUBR
, respectively. ……
Free variables in compiled functions must be declared before the function is compiled. ……
A variable is bound in a particular function when it occurs in a list of bound variables following the wordLAMBDA
orPROG
. Any variable that is not bound is free. ……
When a variable is used free, it must have been bound by a higher level function. If a program is being run interpretively, and a free variable is used without having bee bound on a higher level, error diagnostic *A 8* will occur.
If the program is being run complied, the diagnostic may not occur, and the variable may have value NIL.
There are three types of variables in compiled functions: ordinary variables,SPECIAL
variables, andCOMMON
variables.SPECIAL
andCOMMON
variables must be declared before compiling. Any variable that is not declared will be considered an ordinary variable.
When functions are translated into subroutines, the concept of a variable is translated into a location where an argument is stored. If the variable is an ordinary one, then a storage location for it is set up on the push-down list. Other functions cannot find this private cell, making it impossible to use it as a free variable.SPECIAL
variables have the indicatorSPECIAL
on their property lists Following the indicator there is a pointer to a fixed cell. When this variable is bound, the old value is saved on the push-down list, and the current value is stored in theSPECLAL
cell. When it is no longer bound, the old value must be restored. When a function uses this variable free, then the quantity in theSPECIAL
cell is picked up.SPECIAL
variables are declared by using the pseudo-functionspecial[a]
, wherea
is a list of variable names. ……SPECIAL
variables are inexpensive and will allow free communication among compiled functions. They do not increase run time significantly.SPECIAL
variables cannot be communicated between the interpreter and compiled functions.COMMON
variables have the flagCOMMON
on their property lists; however, this is only used to inform the compiler that they areCOMMON
, and is not needed at run time.COMMON
variables are bound on an a-list by the compiled functions. When they are to be evaluated,eval
is given this a-list. This happens at run time.
The use ofCOMMON
variables will slow down any compiled function using them. However, they do provide complete communication between interpreted and compiled functions.COMMON
variables are declared bycommon[a]
, wherea
is a list of variable names. ……
LISP is designed in such a way that all functions for which the possibility of recursion can exist are in fact recursive. This requires that all temporary stored results related to the computation that is in progress be set aside when a piece of coding is to be used recursively, and that they be later restored. This is done automatically and need not be programmed explicitly.
All saving of temporary results in LISP is performed on a linear block of storage called the push-down list. Each set of stored data that is moved onto the push-down list is in a block labeled with its size and the name of the subroutine from which it came. Since it is in the nature of recursion that the first block to be saved is always the last block to be restored, it is possible to keep the push-down list compact.
Joel Moses. The Function of FUNCTION in LISP, or Why the FUNARG Problem Should Be Called the Environment Problem (pdf). June 1970 [2009-10-27]. AI Memo 199. (原始内容存档于2010-05-23).There are several ways in which to maintain the values of free variables (and, thus the computational environment) in order to handle cases like the one above. All of the techniques used involve some increase in time. One approach makes it easy to access the value of a free variable is local. This approach is favored in recent implementations of LISP. We shall call it the "shallow access" approach.
Another approach is to make it relatively easy to enter and leave a block in which a free variable is bound, but relatively expensive to access a free variable. This approach is used in the classical "alist" implementation of LISP. Several Algol systems have opted for a similar approach. We shall call this the "deep access" approach. Both of these approaches allow one to modify the value of the variable as well as access it. The access time is approximately the same as modification time in both approaches.
Let us consider the "shallow access" approach first. By "shallow access" we mean that the value of a free variable can be obtained in a single fetch from memory. Since the current value may be stored in a difficult-to-determine location up in the stack, a special cell for its current value is used. In many recent LISP implementations this special values cell is stored as a property of the atom structure, and is unique to the free variable. In order to maintain the proper value in the special cell, extra work must be done every time a function or a block is entered in which the variable is an argument or is local. On entering such a function or block one usually saves the old value which is in the special cell on the stack alone with sufficient information to allow one to know from where the stack value came. On leaving such a function or block one must remember to store the value saved in the stack in the special cell.
The "deep access" approach forces one to search upward in the stack for the most recent value of the free variable. Such a search may be quite deep and slow if the free variable was bond vary far up in the stack. However, this approach has the advantage of avoiding the need for an extra special value cell. In addition, we shall see later that this approach sometimes allows one to use free variables with greater ease and flexibility than the "shallow access" approach. ……
Certain LISP systems use "deep access" in interpreted functions and "shallow access" in compiled functions. if free variables in compiled functions are declared to beCOMMON
, their values will be stored on the "alist". In this manner one can solve the environment problem. The cost is a very slow interpreter and "deep access" to free variables. - ^ van Tonder, André. A Lambda Calculus for Quantum Computation. SIAM Journal on Computing. 1 January 2004, 33 (5): 1109–1135. S2CID 613571. arXiv:quant-ph/0307150 . doi:10.1137/S0097539703432165.
- ^ Niehren, J.; Schwinghammer, J.; Smolka, G. A concurrent lambda calculus with futures (PDF). Theoretical Computer Science. November 2006, 364 (3): 338–356 [2021-11-07]. doi:10.1016/j.tcs.2006.08.016. (原始内容 (PDF)存档于2022-04-08).
- ^ 66.0 66.1
Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
DEFINE
- This is analogous to the MacLISPDEFUN
primitive (but note that theLAMBDA
must appear explicitly!). It is used for defining a function in the "global environment" permanently, as opposed toLABELS
(see below), which is used for temporary definitions in a local environment.DEFINE
takes a name and a lambda expression; it closes the lambda expression in the global environment and stores the closure in the LISP value cell of the name (which is a LISP atom). ……LABELS
- We have decided not to use the traditionalLABEL
primitive in this interpreter because it is difficult to define several mutually recursive functions using onlyLABEL
. The solution, which Hewitt [Smith and Hewitt] also uses, is to adopt an ALGOL-esque block syntax:
(LABELS <function definition list> <expression>)
This has the effect of evaluating the expression in an environment where all the functions are defined as specified by the definitions list. Furthermore, the functions are themselves closed in that environment, and not in the outer environment; this allows the functions to call themselves and each other recursively.
Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978 [2021-11-07]. (原始内容存档于2022-04-06).Atoms which are not atomic symbols (identifiers) evaluate to themselves. Typical examples of such atoms are numbers, arrays, and strings (character arrays). Symbols are treated as identifiers or variables. They may be lexically bound by lambda-expressions. There is a global environment containing values initially have as their values primitive operations such as, for example,
CAR
,CONS
, andPLUS
. SCHEME differs from most LISP systems in that the atomCAR
is not itself an operation (in the sense of being an invocable object, e.g. a valid first argument toAPPLY
), but only has one as a value when considered as an identifier. - ^ Structure and Interpretation of Computer Programs - 2.1.3 What Is Meant by Data?. [2024-01-13]. (原始内容存档于2024-02-26).
- ^ y-combinator. [2024-01-06]. (原始内容存档于2024-01-06).
- ^
Gerald J. Sussman, Guy L. Steele Jr.. Lambda: The Ultimate Imperative. 维基文库. 1976 (英文).
Lambda calculus (and related languages, such as "pure LISP") is often used for modelling the applicative constructs of programming languages. However, they are generally thought of as inappropriate for modelling imperative constructs. …… we show how imperative constructs may be modelled by applicative SCHEME constructs. ……
Up to now we have thought of SCHEME’sLAMBDA
expressions as functions, and of a combination such as(G (F X Y))
as meaning “apply the functionF
to the values ofX
andY
, and return a value so that the functionG
can be applied and return a value ...” But notice that we have seldom usedLAMBDA
expressions as functions. Rather, we have used them as control structures and environment modifiers. For example, consider the expression:
(BLOCK (PRINT 3) (PRINT 4))
This is defined to be an abbreviation for:
((LAMBDA (DUMMY) (PRINT 4)) (PRINT 3))
We do not care about the value of thisBLOCK
expression; it follows that we do not care about the value of the(LAMBDA (DUMMY) ...)
. We are not usingLAMBDA
as a function at all.
It is possible to write useful programs in terms ofLAMBDA
expressions in which we never care about the value of any lambda expression. We have already demonstrated how one could represent any “FORTRAN” program in these terms: all one needs isPROG
(withGO
andSETQ
), andPRINT
to get the answers out. The ultimate generalization of this imperative programming style is continuation-passing. ……
…… we will consider the problem of dynamically scoped, or "fluid", variables. These do not exist in ALGOL, but are typical of many LISP implementations, ECL, and APL. We will see that fluid variables may be modelled in more than one way, and that one of these is closely related to continuation-passing. - ^
Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
It is always possible, ……to perform any calculation in this way: rather than reducing to its value, it reduces to an application of a continuation to its value (cf. [Fischer]). That is, in this continuation-passing programming style, a function always "returns" its result by "sending" it to another function.
- ^ Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978.
Non-atomic forms are divided by the evaluator into two classes: combinations and "magic (special) forms". …… Magic forms are recognized by then presence of a "magic (reserved) word" in the
car
position of the form. All non-atomic forms which are not magic forms are considered to be combinations. The system has a small initial set of magic words; there is also a mechanism for creating new ones {Note FUNCALL is a Pain}.
A combination is considered to be a list of subforms. These subforms are all evaluated. The first value mub be a procedure; it is applied to the other values to get the value of the combination. There are four important points here:
(1) the procedure Position is always evaluated just like any other position. (This is why the primitive operators are the values of global identifiers.)
(2) the procedure is never "re-evaluated"; if the first subform fails to evaluate to a applicable procedure, it is an error. Thus, unlike most LISP systems, SCHEME always evaluates the first subform of a combination exactly once.
(3) the arguments are all completely evaluated before the procedure is applied; that is, SCHEME, like most LISP systems, is an applicative-order language. Many SCHEME programs exploit this fact.
(4) the argument forms (and procedure form) may in principle be evaluated in any order. This is unlike the usual LISP left-to-right order. (All SCHEME interpreters implemented so far have in fact performed left-to-right evaluation, but we do not wish programs to depend on this fact. Indeed, there are some reasons why a clever interpreter might want to evaluate them right-to-left, e.g. to get things on a stack in the correct order.) - ^
Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
BLOCK
- This is like the MacLISPPROGN
, but arranges to evaluate its last argument without an extra net control frame (explained later), so that the last argument may involved in an iteration. Note that in SCHEME, unlike MacLISP, the body of aLAMBDA
expression is not an implicitPROGN
.
Gerald J. Sussman, Guy L. Steele Jr.. Lambda: The Ultimate Imperative. 维基文库. 1976 (英文).At this point we are almost in a position to model the most general form of compound statement. In LISP, this is called the "
PROG
feature". In addition to statement sequencing andGO TO
statements, one can return a value from aPROG
by using theRETURN
statement.
Let us first consider the simplest compound statement, which in SCHEME we callBLOCK
. Recall that(BLOCK S1 S2)
is defined to be((LAMBDA (DUMMY) S2) S1)
Notice that this not only performsS1
beforeS2
, but also returns the value ofS2
. Furthermore, we defined(BLOCK S1 S2 ... Sn)
so that it returns the value ofSn
. ThusBLOCK
may be used as a compound expression, and models a LISPPROGN
, which is aPROG
with noGO TO
statements and an implicitRETURN
of the last "statement" (really an expression).
Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978.(BLOCK x1 x2 ... xn)
(BLOCK x) → x
(BLOCK x . r) → ((LAMBDA (A B) (B)) x (LAMBDA () (BLOCK . r)))BLOCK
sequentially evaluates the subformsxi
from left to right. For example:(BLOCK (ASET' X 43) (PRINT X) (+ X 1))
returns44
after settingx
to43
and then printing it {NoteBLOCK
Exploits Applicative Order}.(LET ((v1 x1) (v2 x2) ... (vn xn)) . body)
→ ((LAMBDA (v1 v2 ... vn) (BLOCK . body)) x1 x2 ... xn)LET
provides a convenient syntax for binding several variables to corresponding quantities. It allows the forms for the quantities to appear textually adjacent to their corresponding variables. Notice that the variables are all bound simultaneously, not sequentially, and that the initialization formxi
may be evaluated in any order. For convenience,LET
also supplies aBLOCK
around the forms constituting its body. - ^ Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
DO
- This is like the MacLISP "new-style"DO
; old-styleDO
is not supported. - ^ Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978 [2021-11-07]. (原始内容存档于2022-04-06).
Another difference from LISP is that SCHEME is implemented in such a way that tail-recursions execute without net growth of the interpreter stack. The effect of this is that a procedure call behaves like a
GOTO
and thus procedure calls can be used to implement iterations, as in PLASMA. - ^
Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
CATCH
- This is the "escape operator" which gives the user a handle on the control structure of the interpreter. The expression:
(CATCH <identifier> <expression>)
evaluates<expression>
in an environment where<identifier>
is bound to a continuation which is "just about to return from theCATCH
"; that is, if the continuation is called as a function of one argument, then control proceeds as if theCATCH
expression had returned with the supplied (evaluated) argument as its value. ……
As another example, we can define aTHROW
function, which may then be used withCATCH
much as they are in LISP:
(DEFINE THROW (LAMBDA (TAG RESULT) (TAG RESULT)))
- ^ David Madore, "call/cc mind-boggler" Portuguese Web Archive的存档,存档日期2011-01-22
- ^ Yin Wang, "Understanding the Yin-Yang Puzzle"
- ^ Richard P. Gabriel; Kent M. Pitman. Technical Issues of Separation in Function Cells and Value Cells. Lisp and Symbolic Computation. June 1988, 1 (1): 81–101 [2021-11-08]. S2CID 26716515. doi:10.1007/bf01806178. (原始内容存档于2006-11-13).
- ^ Taylor Campbell. SRFI 62: S-expression comments. The SRFI Editors, schemers.org. 2005-07-21 [2012-08-09]. (原始内容存档于2022-04-11).
- ^ Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978 [2021-11-07]. (原始内容存档于2022-04-06).
Syntactic Extensions - SCHEME has a syntactic extension mechanism which provides a way to define an identifier to be a magic word, and to associate a function with that word. The function accepts the magic form as an argument, and produces a new form; this new form is then evaluated in place of the original (magic) form. This is precisely the same as the MacLISP macro facility.
- ^ Kohlbecker, E.; Friedman, D. P.; Felleisen, M.; Duba, B. Hygienic Macro Expansion (PDF). ACM conference on LISP and functional programming. 1986 [2021-11-10]. (原始内容 (PDF)存档于2017-08-29).
In most current Lisp systems the expander's task is confined to the process of finding syntactic extensions and replacing them by their expansions. This implies, in particular, the each macro function is responsible for the integrity of the program. For Lisp systems (and other languages with similar macro facilities) this means specifically that variable binding must not be corrupted. This, however, is not as simple a task as it sounds. ……
The real danger of these erroneous macros is that they are treacherous. They work in all cases but one: when the user - or some other macro writer - inadvertently picks the wrong identifier name.
Various techniques have been proposed to circumvent this capturing problem, but they rely on the individual macro writer. if even one of many macro writers is negligent, the macro system becomes unsafe. We claim that the task of safely renaming macro-generated identifier is mechanical. It is essentially an α-conversion, which is knowledgeable about the origin of the identifiers. For these reasons we propose a change to the navïe macro expansion algorithm which automatically maintains hygienic conditions during expansion time.
Kohlbecker, E; Wand, M. Macro-by-example: Deriving syntactic transformations from their specifications (PDF). Symposium on Principles of Programming Languages. 1987 [2021-11-10]. (原始内容 (PDF)存档于2022-04-12). - ^ William Clinger and Jonathan Rees, editors. Revised4 Report on the Algorithmic Language Scheme. ACM Lisp Pointers. 1991, 4 (3): 1–55 [2012-08-09]. (原始内容存档于2022-01-22).
- ^ Flatt, Matthew. Binding as sets of scopes. Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. 2016: 705–717. ISBN 978-1-4503-3549-2. S2CID 15401805. doi:10.1145/2837614.2837620.
- ^ 84.0 84.1 Philip L. Bewig. SRFI 41: Streams. The SRFI Editors, schemers.org. 2008-01-24 [2012-08-09]. (原始内容存档于2021-03-07).
- ^ Gerald J. Sussman, Guy L. Steele Jr.. Scheme: An Interpreter for Extended Lambda Calculus. 维基文库. 1975 (英文).
EVALUATE
- This is similar to the LISP functionEVAL
. It evaluates its argument, and then evaluates the resulting s-expression as SCHEME code. - ^ 86.0 86.1 The Scheme of Things: The June 1992 Meeting - Evaluating computed expressions. ACM SIGPLAN, Lisp Pointers, Volume V, Issue 4. 1992 [2021-11-12]. (原始内容存档于2022-04-04).
- ^
Gerald J. Sussman, Guy L. Steele Jr. The Revised Report on SCHEME: A Dialect of LISP. 1978 [2021-11-07]. (原始内容存档于2022-04-06).
(ENCLOSE fnrep envrep)
ENCLOSE
takes two s-expressions, one representing the code for a procedure, and the other representing the (lexical) environment in which is to run.ENCLOSE
returns a (closed) procedure which can be invoked.
The representation of the code is the standard s-expression description (a lambda-expression). the representation of the environment is an association list (a-list) of the traditional kind:
((var1 . value1) (var2 . value2) ...)
NIL
represents the global lexical environment. ……
TheEVALUATE
primitive described in [SCHEME] has been removed from the language. We discovered (the hard way) that the straightforward implementation ofEVALUATE
(evaluate the given expression in the current environment) destroys referential transparency. We then altered it to evaluate the expression in the top-level environment, but were still disturbed by the extent to which one is tied to a particular representation of a procedure to be executed. - ^ William D Clinger. SRFI 6: Basic String Ports. The SRFI Editors, schemers.org. 1999-07-01 [2012-08-09]. (原始内容存档于2021-10-21).
- ^ Scott G. Miller. SRFI 28: Basic Format Strings. The SRFI Editors, schemers.org. 2002-06-25 [2012-08-09]. (原始内容存档于2022-05-08).
- ^ Scheme Systems Supporting SRFIs. The SRFI Editors, schemers.org. 2009-08-30 [2012-08-09]. (原始内容存档于2021-06-20).
- ^ Scheme Benchmarks. [2022-11-14]. (原始内容存档于2022-11-04).
- ^ Brian Harvey, Matthew Wright. Simply Scheme: Introducing Computer Science. 1999 [2021-11-07]. (原始内容存档于2022-05-04).
- ^ Eric Grimson. 6.001 Structure and Interpretation of Computer Programs. MIT Open Courseware. Spring 2005 [2009-10-20]. (原始内容存档于2009-10-01).
- ^ Alex Vandiver, Nelson Elhage; et al. 6.184 - Zombies drink caffeinated 6.001. MIT CSAIL. January 2009 [2009-10-20]. (原始内容存档于2009-08-28).
- ^ Brian Harvey. CS 61A Structure and Interpretation of Computer Programs. UC Berkeley Webcasts. Spring 2011 [2011-12-18]. (原始内容存档于2016-03-16).
- ^ John DeNero. CS 61A Structure and Interpretation of Computer Programs. UC Berkeley. Fall 2011 [2011-12-18]. (原始内容存档于2011-12-29).
- ^ Dov Grobgeld. The GIMP Basic Scheme Tutorial. The GIMP Team. 2002 [2009-10-20]. (原始内容存档于2010-01-24).
- ^ guile-gnome. Free Software Foundation. [2009-10-20]. (原始内容存档于2016-03-04).
延伸阅读
编辑外部链接
编辑- 维基教科书中有关Scheme Programming的文本
- 维基教科书中有关Write Yourself a Scheme in 48 Hours的文本
- 维基共享资源上的相关多媒体资源:Scheme