16.5.3. Memory usage
NFS uses the server's page
cache (in SunOS 4.x, Solaris and System
V Release 4) for file blocks read in NFS read
requests. Because these systems implement page mapping, the NFS
server will use available page frames to cache file pages, and
use
the buffer
cache[48] to store UFS
inode and file metadata (direct and indirect blocks).
In Solaris, you can view the buffer cache statistics by using
sar -b. This will show you the number of data
transfers per second between system buffers and disk
(bread/s & bwrite/s),
the number of accesses to the system buffers (logical reads and
writes identified by lread/s &
lwrit/s), the cache hit ratios
(%rcache & %wcache),
and the number of physical reads and writes using the raw device
mechanism (pread/s &
pwrit/s):
# sar -b 20 5
SunOS bunker 5.8 Generic sun4u 12/06/2000
10:39:01 bread/s lread/s %rcache bwrit/s lwrit/s %wcache pread/s pwrit/s
10:39:22 19 252 93 34 103 67 0 0
10:39:43 21 612 97 46 314 85 0 0
10:40:03 20 430 95 35 219 84 0 0
10:40:24 35 737 95 49 323 85 0 0
10:40:45 21 701 97 60 389 85 0 0
Average 23 546 96 45 270 83 0 0
In practice, a cache hit ratio of 100% is hard to achieve due to lack
of access locality by the NFS clients, consequently a cache hit ratio
of around 90% is considered acceptable. By default, Solaris grows the
dynamically sized buffer cache, as needed, until it reaches a high
watermark specified by the bufhwm kernel
parameter. By default, Solaris limits this value to 2% of physical
memory in the system. In most cases, this 2%[49] ceiling is more
than enough since the buffer cache is only used to cache inode and
metadata information. You can use the
sysdef
command to view its value:
# sysdef
...
*
* Tunable Parameters
*
41385984 maximum memory allowed in buffer cache (bufhwm)
...
If you need to modify the default value of
bufhwm, set its new value in
/etc/system, or use adb as
described in Chapter 15, "Debugging Network Problems".
The actual file contents are cached in the page cache, and by default
the filesystem will cache as many pages as possible. There is no high
watermark, potentially causing the page cache to grow and consume all
available memory. This means that all process memory that has not
been used recently by local applications may be reclaimed for use by
the filesystem page cache, possibly causing local processes to page
excessively.
If the server is used for non-NFS purposes, enable priority paging to
ensure that it has enough memory to run all of its processes without
paging. Priority paging prevents the filesystem from consuming
excessive memory by limiting the file cache so that filesystem I/O
does not cause unnecessary paging of applications. The filesystem can
still grow to use free memory, but cannot take memory from other
applications on the system. Enable priority paging by adding the
following line to
/etc/system and reboot:
*
* Enable Priority Paging
*
set priority_paging=1
Priority paging can also be enabled on a live system. Refer to the
excellent
Solaris Internals book written by
Mauro and McDougall and published by Sun Microsystems Press for an
in-depth explanation of Priority Paging and File System Caching in
Solaris. The following procedure for enabling priority paging on a
live 64-bit system originally appeared on their book:
# adb -kw /dev/ksyms /dev/mem
physmem 3ac8
lotsfree/E
lotsfree:
lotsfree: 234 /* value of lotsfree is printed */
cachefree/Z 0t468 /* set to twice the value of lotsfree */
cachefree: ea = 1d4
dyncachefree/Z 0t468 /* set to twice the value of lotsfree */
dyncachefree: ea = 1d4
cachefree/E
cachefree:
cachefree: 468
dyncachefree/E
dyncachefree:
dyncachefree: 468
Setting
priority_ paging=1 in
/etc/system causes a new memory tunable,
cachefree, to be set to twice the old paging
high watermark,
lotsfree, when the system boots.
The previous
adb procedure does the equivalent
work on a live system.
cachefree scales
proportionally to other memory parameters used by the Solaris Virtual
Memory System. Again, refer to the
Solaris
Internals book for an in-depth explanation. The same
adb procedure can be performed on a 32-bit
system by replacing the
/E directives with
/D to print the
value of a 32-bit quantity and
/Z with /W to set the value
of the 32-bit quantity.